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To My Many Teachers 



PREFACE 


Applied mathematics is largely concerned with the design and analysis of 
explicit procedures for calculating the exact or approximate values of 
various functions. Such explicit procedures are called algorithms or 
programs. Because an effective notation for the description of programs 
exhibits considerable syntactic structure, it is called a programming 
language. 

Much of applied mathematics, particularly the more recent computer- 
related areas which cut across the older disciplines, suffers from the lack 
of an adequate programming language. It is the central thesis of this book 
that the descriptive and analytic power of an adequate programming 
language amply repays the considerable effort required for its mastery. 
This thesis is developed by first presenting the entire language and then 
applying it in later chapters to several major topics. 

The areas of application are chosen primarily for their intrinsic interest 
and lack of previous treatment, but they are also designed to illustrate the 
universality and other facets of the language. For example, the micro¬ 
programming of Chapter 2 illustrates the divisibility of the language, i.e., 
the ability to treat a restricted area using only a small portion of the 
complete language. Chapter 6 (Sorting) shows its capacity to compass a 
relatively complex and detailed topic in a short space. Chapter 7 (The 
Logical Calculus) emphasizes the formal manipulability of the language 
and its utility in theoretical work. 

The material was developed largely in a graduate course given for 
several years at Harvard and in a later course presented repeatedly at the 
IBM Systems Research Institute in New York. It should prove suitable 
for a two-semester course at the senior or graduate level. Although for 
certain audiences an initial presentation of the entire language may be 
appropriate, I have found it helpful to motivate the development by 
presenting the minimum notation required for a given topic, proceeding 
to its treatment (e.g., microprogramming), and then returning to further 
notation. The 130-odd problems not only provide the necessary finger 
exercises but also develop results of general interest. 
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Chapter 1 or some part of it is prerequisite to each of the remaining 
“applications” chapters, but the applications chapters are virtually 
independent of one another. A complete appreciation of search techniques 
(Chapter 4) does, however, require a knowledge of methods of representa¬ 
tion (Chapter 3). The cross references which do occur in the applications 
chapters are either nonessential or are specific to a given figure, table, or 
program. The entire language presented in Chapter 1 is summarized for 
reference at the end of the book. 

In any work spanning several years it is impossible to acknowledge 
adequately the many contributions made by others. Two major acknowl¬ 
edgments are in order: the first to Professor Howard Aiken, Director 
Emeritus of the Harvard Computation Laboratory, and the second to 
Dr. F. P. Brooks, Jr. now of IBM. 

It was Professor Aiken who first guided me into this work and who 
provided support and encouragement in the early years when it mattered. 
The unusually large contribution by Dr. Brooks arose as follows. Several 
chapters of the present work were originally prepared for inclusion in a 
joint work which eventually passed the bounds of a single book and 
evolved into our joint Automatic Data Processing and the present volume. 
Before the split, several drafts of these chapters had received careful 
review at the hands of Dr. Brooks, reviews which contributed many 
valuable ideas on organization, presentation, and direction of investiga¬ 
tion, as well as numerous specific suggestions. 

The contributions of the 200-odd students who suffered through the 
development of the material must perforce be acknowledged collectively, 
as must the contributions of many of my colleagues at the Harvard 
Computation Laboratory. To Professor G. A. Salton and Dr. W. L. 
Eastman, I am indebted for careful reading of drafts of various sections 
and for comments arising from their use of some of the material in courses. 
Dr. Eastman, in particular, exorcised many subtle errors from the sorting 
programs of Chapter 6. To Professor A. G. Oettinger and his students 
I am indebted for many helpful discussions arising out of his early use of 
the notation. My debt to Professor R. L. Ashenhurst, now of the Univer¬ 
sity of Chicago, is apparent from the references to his early (and un¬ 
fortunately unpublished) work in sorting. 

Of my colleagues at the IBM Research Center, Messrs. L. R. Johnson 
and A. D. Falkoff, and Dr. H. Hellerman have, through their own use of 
the notation, contributed many helpful suggestions. I am particularly 
indebted to L. R. Johnson for many fruitful discussions on the applications 
of trees, and for his unfailing support. 

On the technical side, I have enjoyed the assistance of unusually compe¬ 
tent typists and draughtsmen, chief among them being Mrs. Arthur 
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Aulenback, Mrs. Philip J. Seaward, Jr., Mrs. Paul Bushek, Miss J. L. 
Hegeman, and Messrs. William Minty and Robert Burns. Miss Jacquelin 
Sanborn provided much early and continuing guidance in matters of style, 
format, and typography. I am indebted to my wife for assistance in 
preparing the final draft. 

Kenneth E. Iverson 

May , 1962 

Mount Kisco, New York 
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1 

THE LANGUAGE 


1.1 INTRODUCTION 

Applied mathematics is concerned with the design and analysis of 
algorithms or programs. The systematic treatment of complex algorithms 
requires a suitable programming language for their description, and such a 
programming language should be concise, precise, consistent over a wide 
area of application, mnemonic, and economical of symbols; it should 
exhibit clearly the constraints on the sequence in which operations are 
performed; and it should permit the description of a process to be inde¬ 
pendent of the particular representation chosen for the data. 

Existing languages prove unsuitable for a variety of reasons. Computer 
coding specifies sequence constraints adequately and is also comprehensive, 
since the logical functions provided by the branch instructions can, in 
principle, be employed to synthesize any finite algorithm. However, the 
set of basic operations provided is not, in general, directly suited to the 
execution of commonly needed processes, and the numeric symbols used 
for variables have little mnemonic value. Moreover, the description 
provided by computer coding depends directly on the particular represen¬ 
tation chosen for the data, and it therefore cannot serve as a description of 
the algorithm per se. 

Ordinary English lacks both precision and conciseness. The widely used 
Goldstine-von Neumann (1947) flowcharting provides the conciseness 
necessary to an over-all view of the process, only at the cost of suppressing 
essential detail. The so-called pseudo-English used as a basis for certian 
automatic programming systems suffers from the same defect. Moreover, 
the potential mnemonic advantage in substituting familiar English words 
and phrases for less familiar but more compact mathematical symbols fails 
to materialize because of the obvious but unwonted precision required in 
their use. 

Most of the concepts and operations needed in a programming language 
have already been defined and developed in one or another branch of 
mathematics. Therefore, much use can and will be made of existing 
notations. However, since most notations are specialized to a narrow 
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The language § 1.1 

field of discourse, a consistent unification must be provided. For example, 
separate and conflicting notations have been developed for the treatment 
of sets, logical variables, vectors, matrices, and trees, all of which may, in 
the broad universe of discourse of data processing, occur in a single 
algorithm. 

1.2 PROGRAMS 

A program statement is the specification of some quantity or quantities 
in terms of some finite operation upon specified operands. Specification is 
symbolized by an arrow directed toward the specified quantity. Thus “y 
is specified by sin x” is a statement denoted by 

y sin x. 

A set of statements together with a specified order of execution consti¬ 
tutes a program. The program is finite if the number of executions is 
finite. The results of the program are some subset of the quantities 
specified by the program. The sequence or order of execution will be 
defined by the order of listing and otherwise by arrows connecting any 
statement to its successor. A cyclic sequence of statements is called a loop. 




1 

z <- 1 


1 

v <- x x 3.1416 

2 

z <-y x z 


2 

V V X X 

3 

z <- 2 X z 



Program 1.1 Finite Program 1.2 Infinite 

program program 


Thus Program 1.1 is a program of two statements defining the result v as 
the (approximate) area of a circle of radius x, whereas Program 1.2 is an 
infinite program in which the quantity z is specified as (2 y) n on the «th 
execution of the two-statement loop. Statements will be numbered on the 
left for reference. 

A number of similar programs may be subsumed under a single more 
general program as follows. At certain branch points in the program a 
finite number of alternative statements are specified as possible successors. 
One of these successors is chosen according to criteria determined in the 
statement or statements preceding the branch point. These criteria are 
usually stated as a comparison or test of a specified relation between a 
specified pair of quantities. A branch is denoted by a set of arrows leading 
to each of the alternative successors, with each arrow labeled by the 
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comparison condition under which the corresponding successor is chosen. 
The quantities compared are separated by a colon in the statement at the 
branch point, and a labeled branch is followed if and only if the relation 
indicated by the label holds when substituted for the colon. The conditions 
on the branches of a properly defined program must be disjoint and 
exhaustive. 

Program 1.3 illustrates the use of a branch point. Statement a5 is a 
comparison which determines the branch to statements (51, or y 1, 
according as z > n, z = n, or z < n. The program represents a crude but 
effective process for determining x = n A for any positive cube n. 



Program 1.4 shows the preceding program reorganized into a compact 
linear array and introduces two further conventions on the labeling of 
branch points. The listed successor of a branch statement is selected if 
none of the labeled conditions is met. Thus statement 6 follows statement 
5 if neither of the arrows (to exit or to statement 8) are followed, i.e., if 
z < n. Moreover, any unlabeled arrow is always followed; e.g., statement 
7 is invariably followed by statement 3, never by statement 8. 

A program begins at a point indicated by an entry arrow (step 1) and ends 
at a point indicated by an exit arrow (step 5). There are two useful 
consequences of confining a program to the form of a linear array: the 
statements may be referred to by a unique serial index (statement number), 
and unnecessarily complex organization of the program manifests itself in 
crossing branch lines. The importance of the latter characteristic in 
developing clear and comprehensible programs is not sufficiently appre¬ 
ciated. 
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Program 1.4 Linear arrangement of Program 1.3 



A process which is repeated a number of times is said to be iterated , and 
a process (such as Program 1.4) which includes one or more iterated 
subprocesses is said to be iterative . Program 1.5 shows an iterative process 
for the matrix multiplication 


C AB 





1.2 

defined in the usual way as 


Programs 5 


a 


v(^) 

= 2 

k= 1 


x B; 


i = 1 , 2 ,... 

J= 1,2,..., i<B), 


where the dimension of an m x n rectangular matrix X (of m rows and n 
columns) is denoted by /u(X ) x v(X). 

Program 1.5. Steps 1-3 initialize the indices, and the loop 5-7 continues to 
add successive products to the partial sum until k reaches zero. When this 
occurs, the process continues through step 8 to decrement j and to repeat the 
entire summation for the new value of y, providing that it is not zero. If j is zero, 
the branch to step 10 decrements / and the entire process over y and k is repeated 
from j = v(B ), providing that / is not zero. If / is zero, the process is complete, 
as indicated by the exit arrow. 

In all examples used in this chapter, emphasis will be placed on clarity 
of description of the process, and considerations of efficient execution by a 
computer or class of computers will be subordinated. These considerations 
can often be introduced later by relatively routine modifications of the 
program. For example, since the execution of a computer operation 
involving an indexed variable is normally more costly than the corre¬ 
sponding operation upon a nonindexed variable, the substitution of a 
variable s for the variable Cf specified by statement 5 of Program 1.5 
would accelerate the execution of the loop. The variable s would be 
initialized to zero before each entry to the loop and would be used to 
specify C/ at each termination. 

The practice of first setting an index to its maximum value and then 
decrementing it (e.g., the index k in Program 1.5) permits the termination 
comparison to be made with zero. Since zero often occurs in comparisons, 
it is convenient to omit it. Thus, if a variable stands alone at a branch 
point, comparison with zero is implied. Moreover, since a comparison on 
an index frequently occurs immediately after it is modified, a branch at the 
point of modification will denote branching upon comparison of the 
indicated index with zero, the comparison occurring after modification. 
Designing programs to execute decisions immediately after modification of 
the controlling variable results in efficient execution as well as notational 
elegance, since the variable must be present in a central register for both 
operations. 

Since the sequence of execution of statements is indicated by connecting 
arrows as well as by the order of listing, the latter can be chosen arbitrarily. 
This is illustrated by the functionally identical Programs 1.3 and 1.4. 
Certain principles of ordering may yield advantages such as clarity or 
simplicity of the pattern of connections. Even though the advantages of a 
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particular organizing principle are not particularly marked, the uniformity 
resulting from its consistent application will itself be a boon. The scheme 
here adopted is called the method of leading decisions: the decision on each 
parameter is placed as early in the program as practicable, normally just 
before the operations indexed by the parameter. This arrangement groups 
at the head of each iterative segment the initialization, modification, and 
the termination test of the controlling parameter. Moreover, it tends to 
avoid program flaws occasioned by unusual values of the arguments. For 



Program 1.6 Matrix multiplication using leading decisions 

example, Program 1.6 (which is such a reorganization of Program 1.5) 
behaves properly for matrices of dimension zero, whereas Program 1.5 
treats every matrix as if it were of dimension one or greater. 

Although the labeled arrow representation of program branches 
provides a complete and graphic description, it is deficient in the following 
respects: (1) a routine translation to another language (such as a computer 
code) would require the tracing of arrows, and (2) it does not permit 
programmed modification of the branches. 

The following alternative form of a branch statement will therefore be 
used as well: 

x \y, r^s. 

This denotes a branch to statement number s i of the program if the relation 
xr t y holds. The parameters r and s may themselves be defined and re¬ 
defined in other parts of the program. The null element o will be used to 
denote the relation which complements the remaining relations in 
particular, (o) —► (s), or simply —► s , will denote an unconditional branch to 
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statement s. Program 1.7 shows the use of these conventions in a reformu¬ 
lation of Program 1.6. More generally, two or more otherwise independent 
programs may interact through a statement in one program specifying a 
branch in a second. The statement number occurring in the branch must 
then be augmented by the name of the program in which the branch is 
effected. Thus the statement (o) Program 2.24 executed in Program 1 
causes a branch to step 24 to occur in Program 2. 



Program 1.7 A reformulation of Program 1.6, using an algebraic 
statement of the branching 

One statement in a program can be modified by another statement 
which changes certain of its parameters, usually indices. More general 
changes in statements can be effected by considering the program itself as a 
vector p whose components are the individual, serially numbered state¬ 
ments. All the operations to be defined on general vectors can then be 
applied to the statements themselves. For example, theyth statement can 
be respecified by the /th through the occurrence of the statement p, <— p t . 

The interchange of two quantities y and x (that is, x specifies y and the 
original value of y specifies x) will be denoted by the statement y x. 

1.3 STRUCTURE OF THE LANGUAGE 

Conventions 

The Summary of Notation at the end of the book summarizes the nota¬ 
tion developed in this chapter. Although intended primarily for reference, 
it supplements the text in several ways. It frequently provides a more 
concise alternative definition of an operation discussed in the text, and it 
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also contains important but easily grasped extensions not treated explicitly 
in the text. By grouping the operations into related classes it displays 
their family relationships. 

A concise programming language must incorporate families of opera¬ 
tions whose members are related in a systematic manner. Each family will 
be denoted by a specific operation symbol, and the particular member of the 
family will be designated by an associated controlling parameter (scalar, 
vector, matrix, or tree) which immediately precedes the main operation 
symbol. The operand is placed immediately after the main operation 
symbol. For example, the operation k \ x (left rotation of x by k places) 
may be viewed as the /:th member of the set of rotation operators denoted 
by the symbol f. 

Operations involving a single operand and no controlling parameter 
(such as [x\, or \x]) will be denoted by a pair of operation symbols which 
enclose the operand. Operations involving two operands and a controlling 
parameter (such as the mask operation /a, u, b/) will be denoted by a pair 
of operation symbols enclosing the entire set of variables, and the con¬ 
trolling parameter will appear between the two operands. In these cases 
the operation symbols themselves serve as grouping symbols. 

In interpreting a compound operation such as k j (j j x) it is important to 
recognize that the operation symbol and its associated controlling param¬ 
eter together represent an indivisible operation and must not be separated. 
It would, for example, be incorrect to assume that j \ (k j #) were 
equivalent to k \ {j j x), although it can be shown that the complete opera¬ 
tions j | and k \ do commute, that is, k \ (j \ x) = j \ (k \ x). 

The need for parentheses will be reduced by assuming that compound 
statements are, except for intervening parentheses, executed from right to 
left. Thus k \ j j x is equivalent to k \ (j j x), not to (k f j) j x. 

Structured operands such as vectors and matrices, together with a 
systematic component-by-component generalization of elementary opera¬ 
tions, provide an important subordination of detail in the description of 
algorithms. The use of structured operands will be facilitated by selection 
operations for extracting a specified portion of an operand, reduction 
operations for extending an operation (such as logical or arithmetic 
multiplication) over all components, and permutation operations for 
reordering components. Operations defined on vectors are extended to 
matrices: the extended operation is called a row operation if the under¬ 
lying vector operation is applied to each row of the matrix and a column 
operation if it is applied to each column. A column operation is denoted 
by doubling the symbol employed for the corresponding row (and vector) 
operation. 

A distinct typeface will be used for each class of operand as detailed in 
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Table 1.8. Special quantities (such as the prefix vectors a j defined in Sec. 
1.7) will be denoted by Greek letters in the appropriate typeface. For 
mnemonic reasons, an operation closely related to such a special quantity 


Type of 

Representation 

Operand 

Printed 

Typed 

Literal 

Alphabetic 

Numeric 

Roman, u.c. and l.c. 
Standard numeral 

Circled u.c. and l.c. roman. 
Standard numeral 

Variable 

Alphabetic 

Numeric 

Italic, u.c. and l.c. 

Italic numeral 

Unmarked 

Underscore 

Vector 

l.c. boldface italic 

Underscore 

Matrix 

u.c. boldface italic 

Underscore 

Tree 

u.c. boldface roman 

Wavy underscore 


Table 1.8 Typographic conventions for classes of operands 


will be denoted by the same Greek letter. For example, a/fi denotes the 
maximum prefix (Sec. 1.10) of the logical vector u. Where a Greek letter 
is indistinguishable from a Roman, sanserif characters will be used, e.g., 
£ and / for the capitals epsilon and iota. 

Literals and variables 

The power of any mathematical notation rests largely on the use of 
symbols to represent general quantities which, in given instances, are 
further specified by other quantities. Thus Program 1.4 represents a 
general process which determines x = n /z for any suitable value of n. In a 
specific case, say n = 27, the quantity x is specified as the number 9. 

Each operand occurring in a meaningful process must be specified 
ultimately in terms of commonly accepted concepts. The symbols 
representing such accepted concepts will be called literals . Examples of 
literals are the integers, the characters of the various alphabets, punctua¬ 
tion marks, and miscellaneous symbols such as $ and %. The literals 
occurring in Program 1.4 are 0, 1, and 2. 

It is important to distinguish clearly between general symbols and 
literals. In ordinary algebra this presents little difficulty, since the only 
literals occurring are the integers and the decimal point, and each general 
symbol employed includes an alphabetic character. In describing more 
general processes, however, alphabetic literals (such as proper names) also 
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appear. Moreover, in a computer program, numeric symbols (register 
addresses) are used to represent the variables. 

In general, then, alphabetic literals, alphabetic variables, numeric 
literals, and numeric variables may all appear in a complex process and 
must be clearly differentiated. The symbols used for literals will be roman 
letters (enclosed in quotes when appearing in text) and standard numerals. 
The symbols used for variables will be italic letters, italic numerals, and 
boldface letters as detailed in Table 1.8. Miscellaneous signs and symbols 
when used as literals will be enclosed in quotes in both programs and text. 

It is sometimes desirable (e.g., for mnemonic reasons) to denote a 
variable by a string of alphabetic or other symbols rather than by a single 
symbol. The monolithic interpretation of such a string will be indicated 
by the tie used in musical notation, thus: inv, inv , and INV may denote 

the variable “inventory,” a vector of inventory values, and a matrix of 
inventory values, respectively. 

In the set of alphabetic characters, the space plays a special role. For 
other sets a similar role is usually played by some one element, and this 
element is given the special name of null element. In the set of numeric 
digits, the zero plays a dual role as both null element and numeric quantity. 
The null element will be denoted by the degree symbol o. 

In any determinate process, each operand must be specified ultimately in 
terms of literals. In Program 1.4, for example, the quantity k is specified 
in terms of known arithmetic operations (multiplication and division) 
involving the literals 1 and 2. The quantity n, on the other hand, is not 
determined within the process and must presumably be specified within 
some larger process which includes Program 1.4. Such a quantity is called 
an argument of the process. 

Domain and range 

The class of arguments and the class of results of a given operator are 
called its domain and range, respectively. Thus the domain and range of 
the magnitude operation (\x\) are the real numbers and the nonnegative 
real numbers, respectively. 

A variable is classified according to the range of values it may assume: 
it is logical , integral , or numerical , according as the range is the set of 
logical variables (that is, 0 and 1), the set of integers, or the set of real 
numbers. Each of the foregoing classes is clearly a subclass of each class 
following it, and any operation defined on a class clearly applies to any of 
its subclasses. A variable which is nonnumeric will be called arbitrary. In 
the Summary of Notation, the range and domain of each of the operators 
defined is specified in terms of the foregoing classes according to the 
conventions shown in Sec. S.l. 
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1.4 ELEMENTARY OPERATIONS 

The elementary operations employed include the ordinary arithmetic 
operations, the elementary operations of the logical calculus, and the 
residue and related operations arising in elementary number theory. In 
defining operations in the text, the symbol o will be used to denote 
equivalence of the pair of statements between which it occurs. 

Arithmetic operations 

The ordinary arithmetic operations will be denoted by the ordinary 
symbols +, —, x, and ~ and defined as usual except that the domain and 
range of multiplication will be extended slightly as follows. If one of the 
factors is a logical variable (0 or 1), the second may be arbitrary and the 
product then assumes the value of the second factor or zero according as 
the value of the first factor (the logical variable) is 1 or 0. Thus if the 
arbitrary factor is the literal “q,” then 

0 x q = q x 0 = 0 
and 1 x q = q x 1 = q. 

According to the usual custom in ordinary algebra, the multiplication 
symbol may be elided. 

Logical operations 

The elementary logical operations and , or , and not will be denoted by A , 
V, and an overbar and are defined in the usual way as follows: 

w <— u A vow =1 if and only if u — 1 and v — 1, 

w <— u v vow — 1 if and only if u = 1 or v = 1, 

w <— u ow =1 if and only if u = 0. 

If x and y are numerical quantities, then the expression x < y implies 

that the quantity x stands in the relation “less than” to the quantity y. 
More generally, if a and ft are arbitrary entities and 3# is any relation 
defined on them, the relational statement (cf.3tft) is a logical variable which 
is true (equal to 1) if and only if a stands in the relation M to ft. For 
example, if x is any real number, then the function 

(x > 0) — (x < 0) 

(commonly called the sign function or sgn x) assumes the values 1, 0, or 
— 1 according as x is strictly positive, 0, or strictly negative. Moreover, 
the magnitude function \x\ may be defined as \x\ = x x sgn x = x x 
((x > 0) — (x < 0)). 
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The relational statement is a useful generalization of the Kronecker 
delta, that is, df = (/ = y). Moreover, it provides a convenient expression 
for a number of familiar logical operations. The exclusive or , for example, 
may be denoted by (u ^ v ), and its negation (i.e., the equivalence function) 
may be denoted by (w = v). 

Residues and congruence 

For each set of integers «,y, and b , with b > 0, there exists a unique pair 
of integers q and r such that 

n = bq + r, j < r < j + b. 

The quantity r is called the j-residue of n modulo b and is denoted by b | ; n. 
For example, 3 | 0 9 = 0, 3 | x 9 = 3, and, 3 | 0 10 = 1. Moreover, if n > 0, 
then b | 0 n is the remainder obtained in dividing n by b and q is the integral 
part of the quotient. A number n is said to be of even parity if its 0-residue 
modulo 2 is zero and of odd parity if 2 | 0 n = 1. 

If two numbers n and m have the same y-residue modulo b , they differ 
by an integral multiple of b and therefore have the same fc-residue modulo 
b for any k. If b |,- n = b | ; m, then m and n are said to be congruent modb. 
Congruency is transitive and reflexive and is denoted by 

m = n (mod b). 

In classical treatments, such as Wright (1939), only the 0-residue is 
considered. The use of 1-origin indexing (cf. Sec. 1.5) accounts for the 
interest of the 1-residue. 

A number represented in a positional notation (e.g., in a base ten or a 
base two number system) must, in practice, employ only a finite number of 
digits. It is therefore often desirable to approximate a number x by an 
integer. For this purpose two functions are defined: 

1. th e floor of x (or integral part of x), denoted by [x\ and defined as the 
largest integer not exceeding x, 

2. the ceiling of x, denoted by \x] and defined as the smallest integer not 
exceeded by x. 

Thus 

[3.141 = 4, |3.14j = 3, [-3.14J = -4, 

[3.001 = 3, [3.00J = 3, [-3.00J = -3. 

Clearly [»1 = — [— x\ and \x\ < x < \z]. Moreover, n = b\n b\ + b | 0 n 
for all integers n. Hence the integral quotient [« b\ is equivalent to the 
quantity q occurring in the definition of the y-residue for the case j = 0. 
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1.5 STRUCTURED OPERANDS 

Elementary operations 

Any operation defined on a single operand can be generalized to apply 
to each member of an array of related operands. Similarly, any binary 
operation (defined on two operands) can be generalized to apply to pairs 
of corresponding elements of two arrays. Since algorithms commonly 
incorporate processes which are repeated on each member of an array of 
operands, such generalization permits effective subordination of detail in 
their description. For example, the accounting process defined on the 
data of an individual bank account treats a number of distinct operands 
within the account, such as account number, name, and balance. More¬ 
over, the over-all process is defined on a large number of similar accounts, 
all represented in a common format. Such structured arrays of variables 
will be called structured operands , and extensive use will be made of three 
types, called vector , matrix , and tree. As indicated in Sec. S.l of the 
Summary of Notation, a structured operand is further classified as logical , 
integral , numerical , or arbitrary , according to the type of elements it 
contains. 

A vector x is the ordered array of elements (%, x 2 , x 3 , ... , x v{x) ). The 
variable x { is called the zth component of the vector x, and the number of 
components, denoted by v(x) (or simply v when the determining vector is 
clear from context), is called the dimension of x. Vectors and their com¬ 
ponents will be represented in lower case boldface italics. A numerical 
vector x may be multiplied by a numerical quantity k to produce the 
scalar multiple k X x (or kx) defined as the vector z such that z t = k x x { . 

All elementary operations defined on individual variables are extended 
consistently to vectors as component-by-component operations. For 
example, 

* = * + y<=>z { = x t + y i9 
Z = X X yoz { = X , x y i9 
Z = * -=- yoz t = ^T y i9 
z = [* 1 ^ 2 , = \x t ], 
w — u A vow i = u t A V ( , 
w = (x < y)ow i = (Xt < y { ). 

Thus if * = (1, 0, 1, 1) and y — (0, 1, 1, 0) then x + y = (1, 1, 2, 1), 
x A y = (0, 0, 1, 0), and (x < y) = (0, 1, 0, 0). 
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Matrices 

A matrix M is the ordered two-dimensional array of variables 

Im, 1 , m 2 \ m x HM) 

M t 2 , M 2 , M 2 {M) 

The vector (M ± \ M 2 \ . . . , Mj) is called the /th row vector of M and is 
denoted by M\ Its dimension v(M) is called the row dimension of the 
matrix. The vector (M/, Mf ,. . ., Mf) is called the yth column vector 
of M and is denoted by M jt Its dimension ju(M) is called the column 
dimension of the matrix. 

The variable Mf is called the (/,/)th component or element of the matrix. 
A matrix and its elements will be represented by upper case boldface 
italics. Operations defined on each element of a matrix are generalized 
component by component to the entire matrix. Thus, if O is any binary 

operator, P — M Q No P/ = Mf o N/. 

Index systems 

The subscript appended to a vector to designate a single component is 
called an index , and the indices are normally chosen as a set of successive 
integers beginning at 1, that is, x = (x l9 x 2 ,. . . , x v ). It is, however, 
convenient to admit more general j-origin indexing in which the set of 
successive integers employed as indices in any structured operand begin 
with a specified integer j. 

The two systems of greatest interest are the common 1-origin system, 
which will be employed almost exclusively in this chapter, and the 0-origin 
system. The latter system is particularly convenient whenever the index 
itself must be represented in a positional number system and will therefore 
be employed exclusively in the treatment of computer organization in 
Chapter 2. 

1.6 ROTATION 

The left rotation of a vector x is denoted by k f x and specifies the 
vector obtained by a cyclical left shift of the components of x by k places. 
Thus if a = (1,2, 3, 4, 5, 6), and h = (c, a, n, d, y), then 2 \ a — 
(3, 4, 5, 6, 1, 2), and 3 f b = 8 \ b = (d, y, c, a, n). Formally,* 

z = k f xoz t = Xj, where j = v\fi + k). 

* Restating the relation in terms of the 0-residue will illustrate the convenience of the 
1-residue used here. 
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Right rotation is denoted by k j x and is defined analogously. Thus 


z = k | xoz i = x j9 where j = v\fi — k). 


If k — 1, it may be elided. Thus \ b = (a, n, d, y, c). 

Left rotation is extended to matrices in two ways as follows: 

A +-j f Bo A* = j l f B i 
C<-kf\BoC j = k, f JB 5 , 


The first operation is an extension of the basic vector rotation to each row 
of the matrix and is therefore called row rotation. The second operation is 
the corresponding column operation and is therefore denoted by the 
doubled operation symbol ff. For example, if 

* = ( 0 , 1 , 2 ), 

and 


then 



k t B 



and 



Right rotation is extended analogously. 


1.7 SPECIAL VECTORS 

Certain special vectors warrant special symbols. In each of the following 
definitions, the parameter n will be used to specify the dimension. The 
interval vector \?{n) is defined as the vector of integers beginning with j. 
Thus i°(4) = (0, 1, 2, 3), 13(4) = (1, 2, 3, 4), and i“ 7 (5) = (-7, -6, -5, 
—4, —3). Four types of logical vectors are defined as follows. The yth 
unit vector e j (n) has a one in the yth position, that is, (€ j (n)) k = (k = /). 
The full vector z{ri) consists of all ones. The vector consisting of all zeros 
is denoted both by 0 and by €(«). The prefix vector of weight j is denoted 
by OL j (ri) and possesses ones in the first k positions, where k is the lesser of j 
and n. The suffix vector c o j (n) is defined analogously. Thus € 2 (3) = 
(0, 1, 0),€(4) = (1, 1, 1, l),a 3 (5) = (1, 1, 1,0, 0),co 3 (5) = (0,0, 1, 1, l),and 
a 7 (5) = a 5 (5) = (1, 1, 1, 1, 1). Moreover, c o j (n) = j f a j (n), and a j (n) = 
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A logical vector of the form a h (n) A t o 7 (n) is called an infix vector. An 
infix vector can also be specified in the form j j OL k (n), which displays its 
weight and location more directly. 

An operation such as x a y is defined only for compatible vectors x and 
y , that is, for vectors of like dimension. Since this compatibility 
requirement can be assumed to specify implicitly the dimension of one of 
the operands, elision of the parameter n may be permitted in the notation 
for the special vectors. Thus, if y = (3, 4, 5, 6, 7), the expressions e x y 
and e j x y imply that the dimensions of € and e j are both 5. Moreover, 
elision of j will be permitted for the interval vector i j (n) (or E), and for the 
residue operator when j is the index origin in use. 

It is, of course, necessary to specify the index origin in use at any given 
time. For example, the unit vector € 3 (5) is (0,0, 1,0,0) in a 1-origin 
system and (0, 0, 0, 1, 0) in a 0-origin system, even though the definition 
(that is, (€ j (n)) k = (k = j)) remains unchanged. The prefix and suffix 
vectors are, of course, independent of the index origin. Unless otherwise 
specified, 1-origin indexing will be assumed. 

The vector e(0) is a vector of dimension zero and will be called the null 
vector. It should not be confused with the special null element o. 

1.8 REDUCTION 

An operation (such as summation) which is applied to all components 
of a vector to produce a result of a simpler structure is called a reduction. 
The O-reduction of a vector x is denoted by o/x and defined as 

z O/xoz = (• • • ((aq O x 2 ) O x 3 ) O * • *) O O, 

where O is any binary operator with a suitable domain. Thus +/x is the 
sum, x lx is the product, and v /x is the logical sum of the components of 
a vector x. For example, x/i3(5) = 1 x 2 x 3 x 4 x 5, x/i\n) = nl, 
and +I0(n) = n(n + l)/2. 

As a further example, De Morgan’s law may be expressed as A )u— 
V /u, where u is a logical vector of dimension two. Moreover, a simple 
inductive argument (Exercise 1.10) shows that the foregoing expression is the 
valid generalization of De Morgan’s law for a logical vector u of arbitrary 
dimension. 

A relation & incorporated into a relational statement ( x0fy ) becomes, 
in effect, an operator on the variables x and y. Consequently, the reduction 
ffljx can be defined in a manner analogous to that of o/x, that is, 

&/x = (• • • ((x 1 ffix 2 )&x 3 )3? • • • )&x l ). 

The parentheses now imply relational statements as well as grouping. 
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The relational reductions of practical interest are ^/u, and = /u, the 
exclusive-or and the equivalence reduction, respectively. 

The inductive argument of Exercise 1.10 shows that ^ju = 2 | 0 (+/u). 
For example, if u = (1, 0, 1, 1, 0), then 

= ((((1 ^ 0 ) ^ 1 ) # 1 ) ^ 0 ) 

= (((1 ^ 1 )^ 1 )^ 0 ) 

= ((0 ^ 1 )^ 0 ) 

= (1^0)= 1, 

and 2 1 0 (+/u) = 2 | 0 3 = 1. Similarly, =/u = 2 | 0 (+/w), and as a con¬ 
sequence, 

^ l u = =/5, 

a useful companion to De Morgan’s law. 

To complete the system it is essential to define the value of o/c(0), the 
reduction of the null vector of dimension zero, as the identity element of 
the operator or relation O. Thus +/e(0) = v/e(0) = 0, and x/e(0) = 
A/€(0)= 1. 

A reduction operation is extended to matrices in two ways. A row 
reduction of a matrix X by an operator O is denoted by 

y-o/x 

and specifies a vector y of dimension /ll(X ) such that y t = O /X*. A 
column reduction of X is denoted by z +- o//X and specifies a vector z of 
dimension v(X) such that z j = o/X jm 
For example, if 

( 10 10 
0 0 11 
1110 

then + /U = (2, 2, 3), +//U = (2, 1, 3, 1), a//I7 = (0,0, 1,0), 7 ^/^ = 
(0, 0, 1), =//U = (0, 1, 1, 1), and +/(=//17) = 3. 

1.9 SELECTION 
Compression 

The effective use of structured operands depends not only on generalized 
operations but also on the ability to specify and select certain elements or 
groups of elements. The selection of single elements can be indicated by 
indices, as in the expressions v-, AT, M j9 and A//. Since selection is a 
binary operation (i.e., to select or not to select), more general selection is 
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conveniently specified by a logical vector, each unit component indicating 
selection of the corresponding component of the operand. 

The selection operation defined on an arbitrary vector a and a compat¬ 
ible (i.e., equal in dimension) logical vector u is denoted by c +- uja and 
is defined as follows: the vector c is obtained from a by suppressing from 
a each component a { for which u i = 0. The vector u is said to compress 
the vector a. Clearly v{c) = +/u. For example, if u = (1, 0, 0, 0, 1, 1) 
and a = (M, o, n, d, a, y), then uja = (M, a, y). Moreover, if n is even 
and v = (2e) | 0 i \n) = (1,0, 1,0, 1then vji\n) = (1, 3, 51), 
and +j{vji\ri)) = (nj If. 

Row compression of a matrix, denoted by u/A , compresses each row 
vector A 1 ' to form a matrix of dimension /u(A) x +/ti. Column compres¬ 
sion , denoted by u//A, compresses each column vector A j to form a 
matrix of dimension -f/u x v(A). Compatibility conditions are v(u ) = 
v(A) for row compression, and v{u) = ju(A) for column compression. 
For example, if A is an arbitrary 3x4 matrix, u = (0, 1,0, 1) and 
v = (1, 0, 1); then 



and u/vl/A = v/lti/A = I s ^ 

It is clear that row compression suppresses columns corresponding to 
zeros of the logical vector and that column compression suppresses rows . 
This illustrates the type of confusion in nomenclature which is avoided by 
the convention adopted in Sec. 1.3: an operation is called a row operation 
if the underlying operation from which it is generalized is applied to the 
row vectors of the matrix, and a column operation if it is applied to columns. 

Example 1.1. A bank makes a quarterly review of accounts to produce the 
following four lists: 

1. the name, account number, and balance for each account with a balance 
less than two dollars. 

2. the name, account number, and balance for each account with a negative 
balance exceeding one hundred dollars. 

3. the name and account number of each account with a balance exceeding 
one thousand dollars. 


4. all unassigned account numbers. 
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The ledger may be described by a matrix 

( L '\ 

j m 

with column vectors L l9 L 2 , and L s representing names, account numbers, and 
balances, respectively, and with row vectors L\ L 2 , . . . , L m representing 
individual accounts. An unassigned account number is identified by the word 
“none” in the name position. The four output lists will be denoted by the 
matrices P, <?, R , and S, respectively. They can be produced by Program 1.9. 

Program 1.9. Since L 3 is the vector of balances, and 2c is a compatible vector 
each of whose components equals two, the relational statement (L 3 < 2c) defines 
a logical vector having unit components corresponding to those accounts to be 


1 

2 

3 

4 


P-(L 3 <2c)//L 
Q«-(L 3 < -100e)//L 
jR (L 3 > 1000 c)//a 2 /L 

S (Lj = none e)//c 2 /L 


L 

Bank ledger. 

L k 

A:th account. 

w 

Balance of fcth account. 

w 

Account number of Arth 
account. 

V 

Name of &th account or 
“none” if account number 
L 2 k unused. 


Legend 

Program 1.9 Selection on bank ledger L (Example 1.1) 

included in the list P. Consequently, the column compression of step 1 selects 
the appropriate rows of L to define P. Step 2 is similar, but step 3 incorporates 
an additional row compression by the compatible prefix vector a 2 = (1, 1, 0) to 
select columns one and two of L. Step 4 represents the comparison of the name 
(in column L x ) with the literal “none,” the selection of each row which shows 
agreement, and the suppression of all columns but the second. The expression 
“none c” occurring in step 4 illustrates the use of the extended definition of 
multiplication. 

Mesh, mask, and expansion 

A logical vector u and the two vectors a = u/c and b = u/c, obtained 
by compressing a vector c, collectively determine the vector c. The 
operation which specifies c as a function of a , b, and u is called a mesh and 
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is defined as follows: If a and b are arbitrary vectors and if u is a logical 
vector such that +/u = v(a) and +/u = v(b), then the mesh of a and b on 
u is denoted by \a, u , b\ and is defined as the vector c such that u/c = a 
and u/c = b. The mesh operation is equivalent to choosing successive 
components of c from a or b according as the successive components of u 
are 0 or 1. If, for example, a — (s, e, k), b = (t, a), and u = (0, 1,0, 1, 0), 
then \a,u,b\ = ( s, t, e, a, k). As a further example, Program 1.10a 



Program 1.10 Interfiling program 


(which describes the merging of the vectors a and b , with the first and every 
third component thereafter chosen from a) can be described alternatively 
as shown in Program 1.106. Since i 1 = (1, 2, 3, 4, 5, 6,. . .), then 
(3e) | 0 1 1 = (1, 2, 0, 1, 2, 0,. . .), and consequently the vector u specified by 
step 1 is of the form u = (0, 1, 1, 0, 1, 1,0,. . .). 

Mesh operations on matrices are defined analogously, row mesh and 
column mesh being denoted by single and double reverse virgules, respec¬ 
tively. 

The catenation of vectors x, y ,. . . , z is denoted by x @ y @ • • • @ z 
and is defined by the relation 

x © y © * * * © z == (*i> x 2> • • • > X v(x)> y i> • • • > z v<z))- 

Catenation is clearly associative and for two vectors x and y it is a special 
case of the mesh \x, u, y\ in which u is a suffix vector. 
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In numerical vectors (for which addition of two vectors is defined), the 
effect of the general mesh operation can be produced as the sum of two 
meshes, each involving one zero vector. Specifically, 

\*, ", y\ = 0\ + \0, t/, y\ 

= \0, 5, *\ + \0, u, y\. 

The operation \0, u, y\ proves very useful in numerical work and will be 
called expansion of the vector y, denoted by u\y. Compression of u\y by 
u and by u clearly yield y and 0, respectively. Moreover, any numerical 
vector x can be decomposed by a compatible vector u according to the 
relation 

x = u\u/x + u\u/x. 

The two terms are vectors of the same dimension which have no nonzero 
components in common. Thus if u = (1, 0, 1, 0, 1), the decomposition of 
x appears as 

x = (0, Xfy 0, x 4 , 0) + 0, # 3 , 0, # 5 ). 

Row expansion and column expansion of matrices are defined and 
denoted analogously. The decomposition relations become 

X = u\u/X + u\u/X , 

and X = u\\ul/X + u\\u//X. 

The mask operation is defined formally as follows: 

c \a , u, b/ou/c = u/a , and ujc — ujb. 

The vectors c, a, u, and b are clearly of a common dimension and c i = 
or b { according as u i = 0 or u t = 1. Moreover, the compress, expand, 
mask, and mesh operations on vectors are related as follows: 

/a, u, bI = \u/a, u, ujb\, 

\a, u, b\ = / u\a, u, u\b/. 

Analogous relations hold for the row mask and row mesh and for the 
column mask and column mesh. 

Certain selection operations are controlled by logical matrices rather 
than by logical vectors. The row compression U/A selects elements of A 
corresponding to the nonzero elements of U. Since the nonzero elements 
of U may occur in an arbitrary pattern, the result must be construed as a 
vector rather than a matrix. More precisely, U/A denotes the catenation 
of the vectors U'/A 1 ' obtained by row-by-row compression of A by U. 
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The column compression U//A denotes the catenation of the vectors Uj/Aj. 
If, for example, 

( 0 1 0 1 1 \ 

110 0 0 
0 110 0/ 

then UjA = (A 2 \ A a \ A 5 \ A*, A 2 2 , A 2 \ A,% 

and U//A = (A ± 2 , A 2 \ A 2 , A 2 *, A 3 3 , A,\ A, 1 ). 

Compression by the full matrix £ (defined by £ = 0) produces either a 
row list (E/A) or a column list ( E//A ) of the matrix A. Moreover, a numeri¬ 
cal matrix X can be represented jointly by the logical matrix U and the 
row list U/X (or the column list U//X), where U = (X ^ 0). If the matrix 
X is sparse (i.e., the components are predominantly zero), this provides a 
compact representation which may reduce the computer storage required 
for X. 

The compression operations controlled by matrices also generate a group 
of corresponding mesh and mask operations as shown in Sec. S.9. 

1.10 SELECTION VECTORS 

The logical vector u involved in selection operations may itself arise in 
various ways. It may be a prefix vector a j , a suffix c o j , or an infix (/ j a J ); 
the corresponding compressed vectors a j /x, c o j /x, and (/ j a j )/x are called 
a prefix , suffix , and infix of x, respectively. 

Certain selection vectors arise as functions of other vectors, e.g., the 
vector (x > 0) can be used to select all nonnegative components of x , and 
(b ^ *€) serves to select all components of b which are not equal to the 
literal “*.” Two further types are important: the selection of the longest 
unbroken prefix (or suffix) of a given logical vector, and the selection of the 
set of distinct components occurring in a vector. The first is useful in left 
(or right) justification or in a corresponding compression intended to 
eliminate leading or trailing “filler components” of a vector (such as left 
zeros in a number or right spaces in a short name). 

For any logical vector u , the maximum prefix of u is denoted by ol/u and 
defined as follows: 

v <— ol/uov = 

where j is the maximum value for which a/( a j ju) = 1. The maximum 
suffix is denoted by co/u and is defined analogously. If, for example, 
u = (1, 1, 1, 0, 1, 1, 0, 0, 1, 1), then ol/u = (1, 1, 1, 0, 0, 0, 0, 0, 0, 0), 
oj/u = (0, 0, 0, 0, 0, 0, 0, 0, 1, 1), +/ol/u = 3, and +/co/u = 2. 
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The leading zeros of a numerical vector x can clearly be removed either 
by compression: 

y «- («/(* = o))/*, 

or by left justification (normalization): 

2 ( + /«/(* = 0)) f *. 

The extension of the maximum prefix operation to the rows of a logical 
matrix U is denoted by ol/U and defined as the compatible logical matrix 
V, such that V i — a /U 1 . The corresponding maximum column prefix 
operation is denoted by a// U. Right justification of a numerical matrix X 
is achieved by the rotation k j X, where k = +/qj/(X = 0), and top 
justification is achieved by the rotation (+//a//(AT = 0)) {\ X (see Sec. S.6.) 

A vector whose components are all distinct will be called an ordered set. 
The forward set selector on h is a logical vector denoted by ojb and defined 
as follows: the statement v <— o/b implies that v j = 1 if and only if bj 
differs from all preceding components of b. Hence v/b is a set which 
contains all distinct components of b , and +/v/ i is a minimum. For 
example, if c = (C, a, n, a, d, a), then (cx/c)/c = (C, a, n, d) is a list of the 
distinct letters in c in order of occurrence. Clearly (o/b)/b — b if and only 
if b is a set. 

The backward set selector rjb is defined analogously (e.g., (r/c)/c = 
(C, n, d, a)). Forward and backward set selection are extended to matrices 
by both rows ( o/B , and r/B ) and columns (cr//R, and t//B) in the estab¬ 
lished manner. 


1.11 THE GENERALIZED MATRIX PRODUCT 


The ordinary matrix product of matrices X and Y is commonly denoted 
by XY and defined as follows: 


Z 


v(X) 

XYoZj = x 

k = 1 


Y ]: 

3 ’ 


/ = 1,2,...M*) 

J = U2,...v(Y). 


It can be defined alternatively as follows: 


(XY)p = +/(*' x y,). 


This formulation emphasizes the fact that matrix multiplication incorpor¬ 
ates two elementary operations (+, x) and suggests that they be displayed 
explicitly. The ordinary matrix product will therefore be written as 
X+Y. 
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More generally, if O x and 0 2 are any two operators (whose domains 
include the relevant operands), then the generalized matrix product 
X Y is defined as follows: 


For example, if 


U = 

U- \,2,...,v(Yy 


then 


A = 


A t B = 


A l B = 



and 


A _ B = 


B = 



and 


(A # 0) } B = 



The generalized matrix product and the selection operations together 
provide an elegant formulation in several established areas of mathe¬ 
matics. A few examples will be chosen from two such areas, symbolic 
logic and matrix algebra. 

In symbolic logic, De Morgan’s laws (A ju = v/w and =/u = u ) 

can be applied directly to show that 

U t V = Uy V. 


In matrix algebra, the notion of partitioning a matrix into submatrices of 
contiguous rows and columns can be generalized to an arbitrary parti¬ 
tioning specified by a logical vector u. The following easily verifiable 
identities are typical of the useful relations which result: 


X J y = (u/X) + (5//Y) + (ti/X) + x (II//F), 
u/(X J Y) = X ; ( u/Y ), 
uH(X J Y) = (U//X) J y. 

The first identity depends on the commutativity and associativity of the 
operator + and can clearly be generalized to other associative commu¬ 
tative operators, such as A, V, and 
The generalized matrix product applies directly (as does the ordinary 
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matrix product X + Y) to vectors considered as row (that is, 1 x n) or as 
column matrices. Thus: 


z+-X%\ yoz t = o 1 /(X i o 2 y), 

2 *-y I; Xoz s = Oj/'Cy o 2 X,.), 

««-y I: *<=>2 = °i /0 °2 *)• 

The question of whether a vector enters a given operation as a row 
vector or as a column vector is normally settled by the requirement of 
conformability, and no special indication is required. Thus y enters as 
a column vector in the first of the preceding group of definitions and as 
a row vector in the last two. The question remains, however, in the case 
of the two vector operands, which may be considered with the pre-operand 
either as a row (as in the scalar product y J x) or as a column. The 
latter case produces a matrix Z and will be denoted by 

Z^y 6 2 x 9 


where Z/ = y t o 2 //(Z) = v{y), and v(Z) = v(x).* For example, if 
each of the vectors indicated is of dimension three, then 


^ = 


y\> 

3>2. 


r 

y* 

yi 

3>1, 

y» 

1; 

y X e = 1 y» 


3*2 

Jx. 

3>2> 

yj 

w 


^3 


f l 1 0> 


<x 2 (3) X « (3) = | 1 1 0 

vO 0 0/ 


1.12 TRANSPOSITIONS 

Since the generalized matrix product is defined on columns of the 
post-operand and rows of the pre-operand, convenient description of 
corresponding operations on the rows of the post-operand and columns 
of the pre-operand demands the ability to transpose a matrix B , that is, to 
specify a matrix C such that Cj = In ordinary matrix algebra this 
type of transposition suffices, but in more general work transpositions 

* Since each “vector”^* 0 2 x j is of dimension one, no scan operator O x is required, 
and the symbol ° may be interpreted as a “null” scan. 
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about either diagonal and about the horizontal and the vertical are also 
useful. Each of these transpositions of a matrix B is denoted by a superior 
arrow whose inclination indicates the axis of the transposition. Thus: 


C+- 

C<- 

C<- 

C<- 


\ 

B 

/ 

B 

B 

t 

B 


C/ = B} 

— Bv+i-'i j_ / = 1,2,. 

c; = 


Cf = B 


v + l-j J 


j = 1,2,.. 


.,M*) 


-H> t 

For a vector #, either # or # will denote reversal of the order of the com- 

\ 

ponents. For ordinary matrix transposition (that is, B), the commonly 
used notation B will also be employed. 

Since transpositions can effect any one or more of three independent 
alternatives (i.e., interchange of row and column indices or reversal of 
order of row or of column indices), repeated transposition can produce 
eight distinct configurations. There are therefore seven distinct transforma¬ 
tions possible; all can be generated by any pair of transpositions having 
nonperpendicular axes.* 


1.13 SPECIAL LOGICAL MATRICES 

Certain of the special logical vectors introduced in Sec. 1.7 have useful 
analogs in logical matrices. Dimensions will again be indicated in paren¬ 
theses (with the column dimension first) and may be elided whenever the 
dimension is determined by context. If not otherwise specified, a matrix is 
assumed to be square. 

Cases of obvious interest are the full matrix E(m x n ), defined by 
E{m x n) = 0, and the identity matrix l(m x n), defined by 1/ = (/ = j). 
More generally, superdiagonal matrices k l(m x n) are defined such that 
k l/(m x n) = (j = / + k), for k > 0. Clearly °/ = /. Moreover, for 
square matrices h l J k l = ^ h + k U. 

Four triangular matrices will be defined, the geometrical symbols 
employed for each indicating the (right-angled isosceles) triangular area of 

* These transpositions generate the rotation group of the square [cf. Birkhoff and 
Mac Lane (1941) Chap. VI]. A pair of transpositions commute if and only if their axes 
are perpendicular. Hence the pair ««- and t may be written unambiguously as<$- . 
Moreover, =X • The remaining two transformations can be denoted by-^and-e^, 
with the convention that the operator nearest the operand (i.e., the horizontal) is 
executed first. 
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the m x n rectangular matrix which is occupied by ones. Thus 


C+- 

C+- 

C+- 

C+- 


□ (m x n) o C- 


□ (m x n)o C* +1 _j 

□ (m x n)oC^ +1 ~ i 


l =(i + j < min (m, n)) 


□ (m X n)oC^l_) J 


for i = 1,2,.. 
and 7 = 1,2,.. 


m 


n. 


The use of the matrices £ and / will be illustrated briefly. The relation 
u* v = 2\ 0 (u + v) can be extended to logical matrices as follows: 


utv=(2E)\ 0 (inV); 


the trace of a square numerical matrix X may be expressed as t = +///X 
The triangular matrices are employed in the succeeding section. 


1.14 POLYNOMIALS AND POSITIONAL NUMBER 
SYSTEMS 

Any positional representation of a number n in a base b number system 
can be considered as a numerical vector x whose base b value is the quantity 
n = w J x, where the weighting vector u> is defined by w = (b^ -1 , 
b v(x) ~ 2 ,. - - , b 2 , b 1 , 1). More generally, x may represent a number in a 
mixed-radix system in which the successive radices (from high to low order) 
are the successive components of a radix vector y. 

The base y value of x is a scalar denoted by y x and defined as the 
scalar product y x = w J x, where w = □ ^ y is the weighting vector. 
For example, if y = (7, 24, 60, 60) is the radix vector for the common 
temporal system of units, and if x = (0, 2, 1, 18) represents elapsed time in 
days, hours, minutes, and seconds, then 

t = w + * = (86400, 3600, 60, 1) J (0, 2, 1, 18) = 7278 

is the elapsed time in seconds, and the weighting vector w is obtained as 
the product 


1° 1 1 >] 


I 7 } 


x 1(24, 60, 60)1 


^86400\ 

0 0 11 

X 

24 


x/(60, 60) 


3600 

lo 0 0 1, 

/ 

60, 


x/(60) i 


1 60 

o 

O 

O 


w 


(x/€(0) j 

1 

1 >1 


If b is any integer, then the value of x in the fixed base b is denoted by 
(be) _L x. For example, (2e) J_ a 2 (5) = 24. More generally, if y is any real 
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number, then (ye) _L x is clearly a polynomial in y with coefficients x l9 
x 2 , ... , x v , that is, 

(ye.) ± * = x 1 y' ,{x) ~ 1 + • • • + x v _ t y + * v . 

Writing the definition of y _J_ x in the form 
y -L * = (□ * y) $x 

exhibits the fact that the operation J_ is of the double operator type. Its 
use in the generalized matrix product therefore requires no secondary scan 
operator. This will be indicated by a null placed over the symbol Thus 

Z+-XI FoZ/ = rl Yj. 

For example, (ye) ± X represents a set of polynomials in y with coeffi¬ 
cients X v X 2 , . . . , X v , and Y° L x represents a set of evaluations of the 
vector x in a set of bases Y 1 , Y 2 , . . ., Y 1 *. 

1.15 SET OPERATIONS 

In conventional treatments, such as Jacobson (1951) or Birkhoff and 
MacLane (1941), a set is defined as an unordered collection of distinct 
elements. A calculus of sets is then based on such elementary relations as 
set membership and on such elementary operations as set intersection and 
set union , none of which imply or depend on an ordering among members 
of a set. In the present context it is more fruitful to develop a calculus of 
ordered sets. 

A vector whose components are all distinct has been called (Sec. 1.10) 
an ordered set and (since no other types are to be considered) will hereafter 
be called a set. In order to provide a closed system, all of the “set opera¬ 
tions” will, in fact, be defined on vectors. However, the operations will, in 
the special case of sets, be analogous to classical set operations. The 
following vectors, the first four of which are sets, will be used for illustra¬ 
tion throughout. 

t - (t, e, a) 
a = (a, t, e) 

5 = (s, a, t, e, d) 
d = (d, u, s, k) 
n = (n, o, n, s, e, t) 
r = (r, e, d, u, n, d, a, n, t) 


A variable z is a member of a vector x if z = x l for some /. Membership 
is denoted by z e x. A vector * includes a vector y (denoted by either 
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x ^ y or y c= x) if each element 3 ^ is a member of #. If both x ^ y and 
# c= then * and y are said to be similar. Similarity of x and y is denoted 
by x = y. For example, t c s, t c r, f c a, a c t, t = a, and t^k r. If 
x ^ y and x^ky, then x is strictly included in y. Strict inclusion is denoted 
by * c 

The characteristic vector of x on y is a logical vector denoted by e y x , and 
defined as follows: 

u = e y x ov(u) = v(y ), and u j = {y j e x). 

For example, e/ - (0, 1, I, 1, 0),e/ = (1,1, l),e/ = (I, 0, 0, 0, 1), e/ = 
( 1 , 0 , 1 , 0 ), and e„' = ( 1 , 0 , 1 , 0 , 1 , 1 ). 

The intersection of y with x is denoted by y n *, and defined as follows: 

y nx = e/ly. 

For example, s n d = (s, d), d n s = ( d, s), s n r = (a, t, e, d) and 
r n s = (e, d, d, a, t). Clearly, x n y = y n x, although x n ^ is not, in 
general, equal to y n x, since the components may occur in a different 
order and may be repeated a differing number of times. The vector 
x n y is said to be ordered on x. Thus a is ordered on s. If x and y contain 
no common elements (that is, (x n y) = e( 0 )), they are said to be disjoint. 

The set difference of y and x is denoted by y A x and defined as follows: 

y A * = */ly- 

Hence y A x is obtained from y by suppressing those components which 
belong to x. For example, 1/ = (1, 0, 0, 0, 1) and s A t = (s, d). More¬ 
over, e t s = ( 0 , 0 , 0 ) and t A s = e( 0 ). 

The union of y and x is denoted by y U x and defined as follows:* 
y u x = y © (x A y). For example, s u d = (s, a, t, e, d, u, k), d u s = 
(d, u, s, k, a, t, e), s\Ja~s^Jt — s, and n u t = (n, o, n, s, e, t, a). 
In general, x u y = y U x, and x = (x n y) U (* A y). If x and y are 
disjoint, their union is equivalent to their catenation, that is, x n y = e(0) 
implies that x u y = x © y. 

In the foregoing development, the concepts of inclusion and similarity 
are equivalent to the concepts of inclusion and equality in the conventional 
treatment of (unordered) sets. The remaining definitions of intersection, 
difference, and union differ from the usual formulation in that the result of 
any of these operations on a pair of ordered sets is again an ordered set. 
With respect to similarity , these operations satisfy the same identities as do 
the analogous conventional set operations on unordered sets with respect 
to equality. 

* The symbols U and n (and the operations they denote) are commonly called cup 
and cap , respectively. 



30 The language §1.15 

The forward selection ajb and the backward selection rjb defined in 
Sec. 1.10 can both be used to reduce any vector b to a similar set, that is, 

(a/b)lb ee ( r/b)lb s b. 

Moreover, if / = (cr/tf)/*, g = (cr/y)/y, and h = ( a/z)/z , then # == y n s 
implies that / = g n h, and # = ;y u z implies that / = g u h. 

The unit vector e j (n) will be recognized as a special case of the charac¬ 
teristic vector e y x in which x consists of the single component j\ and 
y = where h is the index origin in use. In fact, the notation e{* can be 
used to make explicit the index origin h assumed for e j . 

If z is any vector of dimension two such that z 1 e x and z 2 e y , then z is 
said to belong to the Cartesian product of x and y. Thus if x = (a, b, c) 
and y = (0, 1), the rows of the matrix 



are a complete list of the vectors z belonging to the product set of x and 
y. The matrix A will be called the Cartesian product of x and y and will be 
denoted by x ® y. 

The foregoing definition by example will be formalized in a more 
general way that admits the Cartesian product of several vectors (that is, 
u 0 v ® • • • 0 y) which need not be sets, and which specifies a unique 
ordering of the rows of the resulting matrix. Consider a family of vectors 
x 1 , x 2 , . . . , X s of dimensions d l9 d 2 , . . . , d s . Then 

A *- x 1 ® x 2 <g> • • • ® x°o A 1+duk ~ e) = xl 2 ,..., xi), 

for all vectors k such that 1 < <: d t . Clearly v(A) = s , and /li(A ) = 

x \d. As illustrated by Table 1.11, the rows of the Cartesian product A are 
not distinct if any one of the vectors x j is not a set. 

If the vectors x l are all of the same dimension, they may be considered 
as the columns of a matrix X , that is, X i = x\ The product x 1 ® x 2 

® • • • ® x s = X ± ® X 2 ® • • • (x) X s may then be denoted by ®/X, or 

alternatively by ®//T, where Y is the transpose of X. For example, if 


X = i°(2) ; x €(3) = 


'0 0 0 \ 
.11 ■/' 


then ®jX is the matrix of arguments of the truth table for three variables. 
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•k 1 = (a, b, a) 

* 2 = (#, *) 

* 3 = ( 0 , 1 ) 

d = (3, 2, 2) 




# 

# 


a * 
a * 


A = 


b # 
b # 
b * 
b * 


a # 
a # 


\ a 


* 

* 


il 

1 

0 

1 

0 

1 

0 

1 

0 

1 


Table 1.11 The Cartesian product A = x 1 ® x 2 ® # 3 


1.16 RANKING 

The rank or index of an element c e b is called the b index of c and is 
defined as the smallest value of / such that c = b im To establish a closed 
system, the b index of any element a 4 b will be defined as the null character 
o. The b index of any element c will be denoted by b t c\ if necessary, the 
index origin in use will be indicated by a subscript appended to the 
operator i. Thus, if b = (a, p, e), b i 0 p = 1 , and b c x p = 2. 

The b index of a vector c is defined as follows: 

k +- b l c<=>k i = b i c*. 

The extension to matrices may be either row by row or (as indicated by a 
doubled operator symbol u) column by column, as follows: 

J^-BiCoJ 1 = BiC\ 

K <— B uC o Kj = B j i C 3 .. 

Use of the ranking operator in a matrix product requires no secondary 
scan and is therefore indicated by a superior null symbol. Moreover, 
since the result must be limited to a two-dimensional array (matrix), either 
the pre- or post-operand is required to be a vector. Hence 

j+.B^coJ 1 = B* i c, 

K^-b^CoK j = bi C,. 

The first of these ranks the components of c with respect to each of a set 
of vectors B 1 , J3 2 , . . . , B /l , whereas the second ranks each of the vectors 
C v C 2 ,. . . , C v with respect to the fixed vector b. 
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The use of the ranking operation can be illustrated as follows. Consider 
the vector b = (a, b, c, d, e) and the set of all 3 5 three-letter sequences 
(vectors) formed from its components. If the set is ordered lexically, and 
if x is the zth member of the set (counting from zero), then 

j = (v(b)e) ± (b i 0 x). 

For example, if x = (c, a, b), then (b i 0 x) = ( 2 , 0, 1), and j — 51. 


1.17 MAPPING AND PERMUTATIONS 


Reordering operations 

The selection operations employed thus far do not permit convenient 
reorderings of the components. This is provided by the mapping operation 
defined as follows:* 

c+-a k oc { = a ki . 

For example, if a = (a, b, . . . , z) and k = ( 6 , 5, 4), then c = (f, e, d). 

The foregoing definition is meaningful only if the components of k each 
lie in the range of the indices of a, and it will be extended by defining a 5 as 
the null element o if j does not belong to the index set of a. Formally, 


= 


if m i e 

if m i 4 \}(v{a)). 


The ability to specify an arbitrary index origin for the vector a being 
mapped is provided by the following alternative notation for mapping: 


a m . if m i e i 5 (v(o)) 

c m aoc i = 

(o if m, 4 i 3 (v(a)), 

where y-origin indexing is assumed for the vector a. For example, if a 
is the alphabet and m = (5, o, o, 4, 27, o, 3), then c = m J 0 a = (f, o 5 o, e, 
o, o 5 d), and (c ^ o e)/c = (f, e, d). Moreover, m| 2 a = (d, o, o, c, z, o, b). 
Elision of j is permitted. 

If a c h, and m = b i 5 a, then clearly mj j b = a. If a £ b , then 
m b contains (in addition to certain nulls) those components common to 
b and a, arranged in the order in which they occur in a . In other words, 

(m ^ °€)/(m Sj b) = a n b. 


* For the purposes of describing algorithms, this notation is superior to the classical 
“disjoint cycles” notation for permutations [cf. Birkhoff and MacLane, (1941)] because 
(1) the direction of the transformation (from a to c) is unequivocally indicated, and (2) 
the notation directly indicates a straightforward and efficient method for actual execu¬ 
tion, namely, indirect addressing. 
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Consequently, if p, q,... 9 1 are vectors, each contained in h, then each 
can be represented jointly by the vector b and a mapping vector. If, for 
example, b is a glossary and p, q, etc., are texts, the total storage required 
for b and the mapping vectors might be considerably less than for the 
entire set of texts. 

Mapping may be shown to be associative, that is, m 1 J i (m 2 J y a) = 
(m 1 m 2 )J y a. Mapping is not, in general, commutative. 

Mapping is extended to matrices as follows: 

A^M^BoA* = M'$ h B\ 

C ■*- M^ h BoCj — Mj h Bj. 

Row and column mappings are associative. A row mapping 1 M and a 
column mapping 2 M do not, in general, commute, but do if all rows of 
1 M agree (that is, 1 M = € ° p), and if all columns of 2 M agree (that is, 
2 M = q o c). The generalized matrix product is defined for the cases 

m J A, and M J a. 

The alternative notation (that is, c = <* m ), which does not incorporate 
specification of the index origin, is particularly convenient for matrices and 
is extended as follows: 

A B m o A i = , 

A ^B m oA { = B m , 

Permutations 

A vector k of dimension n is called a y-origin permutation vector if 
k = i \n). A permutation vector used to map any set of the same dimen¬ 
sion produces a reordering of the set without either repetition or suppres¬ 
sion of elements, that is, k^a — a for any set a of dimension v(k). For 
example, if a = (f, 4, *, 6, z), and k = (4, 2, 5, 1, 3), then k Ji a = (6, 4, 
z, f, *). 

If p is an h -origin permutation vector and q is any y-origin permutation 
vector of the same dimension, then q p is an /^-origin permutation vector. 
Since 

I'M 0 ))!, a = a, 

the interval vector i j (n) will alsa be called the j-origin identity permutation 
vector. If p and q are two y-origin permutation vectors of the same 
dimension n and if q$jp = i j (n), then p J, q = i ?(n) also andp and q are 
said to be inverse permutations. If p is any y-origin permutation vector, 
then q = p ij i j is inverse to p. 

The rotation operation k \ x is a special case of permutation. 
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Function mapping 

A function / which defines for each element of a set b a unique 
correspondent a k in a set a is called a mapping from b to a . If f(b= a k , 
the element b i is said to map into the element a k . If the elements /( b t .) 
exhaust the set a, the function/is said to map b onto a. If b maps onto a 
and the elements f(b t ) are all distinct, the mapping is said to be one-to-one 
or biunique. In this case, v(a) = v(b), and there exists an inverse mapping 
from a to b with the same correspondences. 

A program for performing the mapping/from b to a must therefore 
determine for any given element b € b, the correspondent a e a, such that 
a = f(b). Because of the convenience of operating upon integers (e.g., 
upon register addresses or other numeric symbols) in the automatic 
execution of programs, the mapping is frequently performed in three 
successive phases, determining in turn the following quantities: 

1. the index i = b i b, 

2. the index k such that a k = /(&■), 

3. the element a k . 

The three phases are shown in detail in Program 1.12a. The ranking is 
performed (steps 1-3) by scanning the set b in order and comparing each 
element with the argument b. The second phase is a permutation of the 
integers 1 , 2 ,..., v(b ), which may be described by a permutation vector 
/, such that / = k. The selection of j i (step 4) then defines k , which, in 
turn, determines the selection of a k on step 5. 

Example 1.2. If 

b = (apple, booty, dust, eye, night), 
a = (Apfel, Auge, Beute, Nacht, Staub) 

are, respectively, a set of English words and a set of German correspondents (both 
in alphabetical order), and if the function required is the mapping of a given 
English word b into its German equivalent a according to the dictionary corre¬ 
spondences : 

English: apple booty dust eye night 

German: Apfel Beute Staub Auge Nacht 

then j = (1, 3, 5, 2, 4). If b = “night,” then / = 5,/ = 4, and a = a 4 = Nacht. 

If k is a permutation vector inverse to/, then Program 1.126 describes a 
mapping inverse to that of Program 1.12a. If / = (1,3, 5, 2, 4), then 
k = (1, 4, 2, 5, 3). The inverse mapping can also be described in terms of 
/, as is done in Program 1.12c. The selection of the ith component of the 
permutation vector is then necessarily replaced by a scan of its components. 
Programs 1.12c/ and 1.12c show alternative formulations of Program 1.12a. 





(e) b t <=> a Ji 


a 

Set of correspondents in 
Programs (a, d, e) and set 
of arguments in Programs 
(b, c). 

b 

Set of arguments in Pro¬ 
grams (a, d, e) and set 
of correspondents in Pro¬ 
grams (b, c). 

J,k 

Mutually inverse 
permutation vectors. 


Legend 


Program 1.12 Mapping defined by a permutation vector./ 
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Ordering vector 

If x is a numeric vector and k is ay-origin permutation vector such that 
the components of y = k J jf x are in ascending order, then k is said to order 
x. The vector k can be determined by an ordering operation defined as 
follows: 

k 0JX 

implies that k is a j- origin permutation vector, and that if y — k x, then 
either y { < y i+1 or y i = y i+1 and k i < k i+1 . The resulting vector k is 
unique and preserves the original relative order among equal components. 
For example, if x == (7, 3, 5, 3), then djx = (2, 4, 3, 1). 

The ordering operation is extended to arbitrary vectors by treating all 
nonnumeric quantities as equal and as greater than any numeric quantity. 
For example, if a = (7, °, 3,,° 5, 3), then Oja = (3, 6, 5, 1, 2, 4), and if 
b is any vector with no numerical components, then djb = i j (v(b)). 

Ordering of a vector a with respect to a vector b is achieved by ordering 
the fr-index of a. For example, if a — (e, a, s, t, 4, 7, t, h), and b is the 
alphabet, then m — b t x a = ( 5, 1, 19, 20, °, °, 20, 8) and O^m = (2, 1, 8, 
3, 4, 7, 5, 6). 

The ordering operation is extended to matrices by the usual convention. 
If K = Ojl/A, then each column of the matrix B = K JJ ; A is in ascending 
order. 


1.18 MAXIMIZATION 

In determining the maximum m over components of a numerical vector 
x , it is often necessary to determine the indices of the maximum components 
as well. The maximization operator is therefore defined so as to determine 
a logical vector v such that v/x = me. 

Maximization over the entire vector x is denoted by e\x , and is defined as 
follows: if v — e\x, then there exists a quantity m such that v/x = me and 
such that all components of v/x are strictly less than m. The maximum is 
assumed by a single component of x if and only if +/v = 1. The actual 
value of the maximum is given by the first (or any) component of v/x. 
Moreover, the /-origin indices of the maximum components are the 
components of the vector v/i j . 

More generally, the maximization operation v u\x will be defined so 
as to determine the maximum over the subvector u\x only, but to express 
the result v with respect to the entire vector x. More precisely, 

v u\xov = u\(e\(u/x)). 

The operation may be visualized as follows—a horizontal plane punched 
at points corresponding to the zeros of u is lowered over a plot of the 
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components of x , and the positions at which the plane first touches them 
are the positions of the unit components of v. For example, maximization 
over the negative components of x is denoted by 

v <- (x < 0)[* 

and if * = (2, -3, 7, -5, 4, -3, 6), then (x < 0) = (0, 1, 0, 1, 0, 1, 0), 
v = (0, 1, 0, 0, 0, 1, 0), v/x = (-3, -3), (»/*)! = -3, and, vji 1 = (2, 6). 
Minimization is defined analogously and is denoted by u[x. 

The extension of maximization and minimization to arbitrary vectors is 
the same as for the ordering operation, i.e., all nonnumeric quantities are 
treated as equal and as exceeding all numeric quantities. The extensions 
to matrices are denoted and defined as follows: 

V<- U\XoV i = U*\X\ 

V *- U\\XoV j = u^x,, 

V*-U\ xoV * = IT]*, 

V<- u\ Xo V i = u\X j . 

As in the case of the ordering operation, maximization in a vector a with 
respect to order in a set b is achieved by maximizing over the b-index of a. 
Thus if 

dchdhshdchchd 

a6kq435k82j92 
represents a hand of thirteen playing cards, and if 

/c, d, h, S, 0, o, o, o, o, o, o, o, o\ 

\2, 3, 4, 5, 6, 7, 8, 9, 10, j, q, k, a/ 

/ 1 , 0 , 2 , 1 , 2 , 3 , 2 , 1 , 0 , 2 , 0 , 2 , 1 

then B t H = 

\12, 4, 11,10, 2, 1,3, 11, 6, 0,9, 7,0 
(4,13) 1 (B t 0 H) = (25, 4, 37, 23, 28, 40, 29, 24, 6, 26, 9, 33, 13), 
and ( e l((4, 13) ° (B t 0 H)))/H = (s, 3) 

is the highest ranking card in the hand. 

1.19 INVERSE FUNCTIONS 

To every biunique* function/ there corresponds an inverse function g 
such that g{f(x)) = x for each argument x in the domain of the function f. 

* If the function f is many-to-one, the specification of a unique inverse^ is achieved 
by restricting the range of g to some set of “pri nc 'P a r values, as is done, for example, 
for the inverse trigonometric functions. 
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It is common practice either to introduce a distinct symbolism for the 
inverse function, as for the inverse functions of logarithm (log b x) and 
exponentiation (6 3 ), or to use a superscript — 1, as in sin -1 a: or f~\x). 

The first alternative doubles the number of distinct operator symbols 
required and obscures the relation between pairs of inverse functions; 
the second raises other difficulties. The solution adopted here is that of 
implicit specification; i.e., a statement is permitted to specify not only a 
variable but also any function of that variable. Functions may therefore 
appear on both sides of the specification arrow in a statement. For 
example, 

(2c) J _ x <—z 

specifies the variable x as the vector whose base two value is the number z. 

Certain ambiguities remain in the foregoing statement. First, the 
dimension of x is not specified. For example, if z = 12, x = (1, 1, 0, 0) is 
an admissible solution, but so are (0, 1, 1,0, 0) and (0, 0, 0, 1, 1,0, 0). 
This could be clarified by compatibility with a specified dimension of c. 
Thus the statement 

(2«(5)) J_ x t-z 

specifies x unambiguously as (0, 1, 1, 0, 0). More generally, however, any 
previously specified auxiliary variables will be listed to the right of the 
main statement, with a semicolon serving as a separation symbol. The 
current example could therefore be written as 

v(x) 5 

(2c) _L x<-z; v(x). 

The second ambiguity concerns the permissible range of the individual 
components of x. For example, the base two value of x = (5, 2) is also 
twelve. For certain functions it is therefore necessary to adopt some 
obvious conventions concerning the range of the result. The assumption 
implicit in the preceding paragraph is that each component of x is limited 
to the range of the residues modulo the corresponding radix. This con¬ 
vention will be adopted. Hence the pair of statements 

y - (7, 24, 60, 60) 

7278; y 

determines x unambiguously as the vector (0, 2, 1, 18). 

It is also convenient, though not essential, to use selection operations on 
the left of a statement. Thus the statement 

ujb <— a 
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is understood to respecify only the selected components of b and to leave 
all others unchanged. It is therefore equivalent to the statement 

b «- \u/b, u, a\. 


Similarly, 


u/b uja 


is equivalent to 


b «-/*>, u, a/. 


1.20 LEVELS OF STRUCTURE 

Vectors and matrices are arrays which exhibit one level and two levels of 
structure, respectively. Although in certain fields, such as tensor analysis, 
it is convenient to define more general arrays whose rank specifies the 
number of levels of structure (i.e., zero for a scalar, one for a vector of 
scalars, two for a vector of vectors (matrix), three for a vector of matrices, 
etc.), the notation will here be limited to the two levels provided by the 
matrix.* The present section will, however, indicate methods for removing 
this limitation. 

The only essential particularization to two levels occurs in the provision 
of single and double symbols (e.g., “/” and “_L” and “_[]_”) for row 
and column operations, respectively, and in the use of superscripts and 
subscripts for denoting rows and columns, respectively. In applications 
requiring multiple levels, the former can be generalized by adjoining to the 
single symbol an index which specifies the coordinate (e.g., “/i” and “/ 2 ’\ 
for row and for column compression, and, in general, “//’.) The latter can 
be generalized by using a vector index subscript possessing one component 
index for each coordinate. 

The generalized notation can be made compatible with the present 
notation for vectors and matrices by adopting the name tensor and a 
symbol class (such as capital italics) for the general array of arbitrary rank. 

1.21 SUBROUTINES 

Detail can be subordinated in a more general manner by the use of 
subroutines. The name of one program appearing as a single statement in 
a second program implies execution of the named program at that point; 
the named program is called a subroutine of the second program. If, for 
example, “Cos” is the name of a program which specifies z as the cosine of 

* Further levels can, of course, be handled by considering a family of matrices 1 M, 
2 M,. .., n M, or families of families /M. 
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the angle between the vectors x and y, then Program 1.13 a uses the pro¬ 
gram “Cos” as a subroutine to determine r as the cosine of the angle 
between the vectors p and q. 



(a) (6) (c) 

Program 1.13 Modes of subroutine reference 

It is sometimes convenient to include the names of the arguments or 
results or both in the name of the subroutine as dummy variables. Thus if 
“Cos (x, y )” is the name of a subroutine which determines 2 as the cosine 
of the angle between x and y, then Program 1.136 uses Cos (x, y) as a 
subroutine to determine r as the cosine of the angle between p and q. 
Similarly, the program “z <— Cos (x, y)” can be used as in Program 1.13c 
to produce the same result. 

1.22 FILES 

Many devices used for the storage of information impose certain restric¬ 
tions upon its insertion or withdrawal. The items recorded on a magnetic 
tape, for example, may be read from the tape much more quickly in the 
order in which they appear physically on the tape than in some other 
prescribed order. 

Certain storage devices are also self-indexing in the sense that the item 
selected in the next read from the device will be determined by the current 
state or position of the device. The next item read from a magnetic tape, 
for example, is determined by the position in which the tape was left by the 
last preceding read operation. 

To allow the convenient description of algorithms constrained by the 
characteristics of storage devices, the following notation will be adopted. 
A file is a representation of a vector x arranged as follows: 

Pv X lt P2> X 2> • • • 9 X v(x)> Pv(x) 4 1> °» Pv{i c)+2> °» * * * 9 Pv(py 
The null elements denote the “unused” portion of the file not employed in 
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representing x. Each partition p, determines a position (position j) in the 
file. If a file O is in position j, then a forward read, denoted by 

x,P+- 0 $, 

specifies x by the component x j9 the auxiliary variable p by the succeeding 
partition p m , and stops the file in the position j + 1. 

The position of a file O will be denoted by 77(0). Thus the statement 
j 77(0) specifies j as the position of O, whereas 77(0) j positions the 
file to j. In particular, 7t(0) 1 denotes the rewinding of the file and 

77 ( 0 ) v denotes winding , i.e., positioning to the extreme end of the file. 
Any file for which the general positioning operation 7r(0) <- j is to be 
avoided as impossible or inefficient is called a serial or serial-access file. 

Each terminal partition (that is, p x and p„ (p) ) assumes a single fixed 
value denoted by A. Each nonterminal partition p, may assume one of 
several values denoted by X 2 , . . . , X r( ^ )? the partitions with larger 
indices normally demarking larger subgroups of components within the 
file. Thus if x were the row list of a matrix, the last component might be 
followed by the partition X 3 , the last component of each of the preceding 
rows by X 2 , and the remaining components by The auxiliary variable p 
specified by the partition symbol during the read of a file is normally used 
to control a subsequent branch. 

A file may be produced by a sequence of forward record statements: 

0 f I> *— x,,p for i € i\v(x)), 

where p is the partition symbol recorded after the component x jt As in 
reading, each forward record operation increments the position of the file 
by one. A file which is only recorded during a process is called an output 
file of the process; a file which is only read is called an input file. 

Different files occurring in a process will be distinguished by righthand 
subscripts and superscripts, the latter being usually employed to denote 
major classes of files, such as input and output. 

Example 1.3. A set of m input files O* 1 , / e i\m), each terminated by a partition 
X 2 , is to be copied to a single output file O x 2 as follows. Successive items (com¬ 
ponents) are chosen in turn from files Oj 1 , <1,. .., Oj 1 , Og 1 , . . . , always 
omitting from the sequence any exhausted file. A partition X 2 is to be recorded 
with the last item recorded on <D 1 2 , and all files are to be rewound. The process is 
described by Program 1.14. 

Program 1.14. Step 8 cycles k through the values 1 to w, and step 9 allows the 
read on step 10 to occur only if u k = 0. The logical vector u is of dimension m 
and designates the set of exhausted files. Its A;th component is set to unity by 
step 11 when file k is exhausted, as indicated by the occurrence of the partition X 2 . 
Each read is normally followed by step 13, which records on the output file the 
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Program 1.14 Program for Example 1.3 


item read. However, when the last file becomes exhausted, step 14 is executed 
instead to record the last item, together with the final partition X 2 . 

Steps 1-6 initialize the parameters u and k and rewind all files. After the last 
item is recorded by step 14, the file rewinds are repeated before the final termina¬ 
tion on step 7. 

It is sometimes convenient to suppress explicit reference to the partition 
symbol read from a file by using a statement of the form 



where the indicated branches depend on the value of the partition p j+1 
which terminates the read. Thus the left or the right branch is taken 
according to whether p j+1 = \ or p j+1 = X 2 . Certain files (such as the 
IBM 7090 tape files) permit only such “immediate” branching and do not 
permit the partition symbol to be stored for use in later operations, as was 
done in Program 1.14. 

In recording, the lowest level partition X x may be elided. Thus statement 
13 of Program 1.14 may be written as 

O, 2 b. 
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A file may be read or recorded backward as well as forward. A backward 
read is denoted by 

x,p<- !<!>, 

and if ® is initially in position j + 1, then x — x j9 p = p j9 and the final 
position becomes j. Backward recording is defined analogously. The 
zero prescript may be omitted from the symbol 0 O for both forward 
reading and recording. 

The conventions used for matrices can be applied in an obvious way to 
an array of files O/. For example, the statement 

7T(0*) € 

denotes the rewinding of the row of files O/, j e i 1 (r(d>)); the statement 

77 -( 0 ; ) € 

denotes the rewinding of the column of files O/, / e ^(//(O)); and the 
statement 

t//O z ' +- u/x 9 ii/p 

denotes the recording of the vector component x j on file O/ together with 
partition pj for all j such that = 1. 

As for vectors and matrices, y-origin indexing may be used and will 
apply to the indexing of the file positions and the partition vector X as well 
as to the array indices. However, the prescripts (denoting direction of 
read and record) are independent of index origin. O-origin indexing is 
used in the following example. 

Example 1.4. Files <D 0 ° and Oj 0 contain the vectors x andjy, respectively, each 
of dimension n. In the first phase, the components are to be merged in the order 
# 0 , x i> JVi» • • • , yv-h an d the first n components of the resulting vector 
are to be recorded on file Oq 1 , and the last n on file 0 1 1 . In other words, the 
vectors x 1 = a 71 /z, andjy 1 = cj n /z are to be recorded on Oq 1 and Oj 1 , respectively, 
where z = \x 9 u 9 y\ , and u = (0, 1, 0, 1, ... , 0, 1). In the next phase, the roles 
of input and output files are reversed, and the same process is performed on x 1 
and y l 9 that is, x 2 = a n l(\x 1 9 u , and y 2 = c o n l(\x 1 , u, jv J \) are recorded on 
files O 0 ° and O! 0 , respectively. The process is to be continued through m phases. 

Program 1.15. The program for Example 1.4 begins with the rewind of the 
entire 2x2 array of files. To obviate further rewinding, the second (and each 
subsequent even-numbered) execution is performed by reading and recording all 
files in the backward direction. Step 6 performs the essential read and record 
operation under control of the logical vector u, whose components u 0 , u l9 u 2 
determine, respectively, the subscript of the file to be read, the subscript of the 
file to be recorded, and the direction of read and record. The file superscripts 
(determining which classes serve as input and output in the current repetition) are 
also determined by u 2 , the input being u 2 and the output u 2 . The loop 6-8 copies 
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0-origin indexing 

0 

File array of dimension 

2 x 2; original input 0°; 
original output O 1 . 

u 

Control vector. 

u o 

Column index of input 
file. 

U 1 

Column index of output 
file. 

U 2 

Row index of current 
input file, and direction 
of read and record. 

n 

Number of items per 
file. 

m 

Required number of 
merges. 



Legend 

Program 1.15 Program for Example 1.4 

n items, alternating the input files through the negation of u 0 on step 7. When 
the loop terminates, is negated to interchange the outputs, and the loop is 
repeated unless — u 2 . Equality occurs and causes a branch to step 3 if and 
only if all 2 n items of the current phase have already been copied. 

Step 3 decrements m and is followed by the negation of u on step 4. The com¬ 
ponent u 2 must, of course, be negated to reverse direction, but the need to negate 
u 0 and u x is not so evident. It arises because the copying order was prescribed for 
the forward direction, beginning always with the operation 

0<V O^V* 

An equivalent backward copy must therefore begin with the operation 

d) p d) 7> 

1^1 1 1 1 • 

Not all computer files have the very general capabilities indicated by the 
present notation. Some files, for example, can be read and recorded in the 
forward direction only and, except for rewind, cannot be positioned 
directly. Positioning to an arbitrary position k must then be performed by 
a rewind and a succession of (ik — 1) subsequent reads. In some files, re¬ 
cording can be performed in the forward direction only, and the positions 
are defined only by the recorded data. Consequently, recording in posi¬ 
tion k makes unreliable the data in all subsequent positions, and recording 
must always proceed through all successive positions until terminated. 
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1.23 ORDERED TREES 
Directed graphs 

For many processes it is convenient to use a structured operand with the 
treelike structure suggested by Fig. 1.16. It is helpful to begin with a more 




Figure 1.16 A general triply rooted tree with A(T) = 16, v(T) = (3, 3, 4, 3, 2), 
v(T) = 5, *i(T) = (3, 7, 8 , 5, 3), and ^(T) = 26 
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general structure (such as Fig. 1.17) in which a unidirectional association 
may be specified between any pair of its components. 

A directed graph comprises a vector n and an arbitrary set of unilateral 
associations specified between pairs of its components. The vector n is 
called a node vector and its components are also called nodes . The associa¬ 
tions are conveniently specified by a (logical) connection matrix U of 
dimension v(n) x v(n) with the following convention: there is an associa¬ 
tion, called a branch , from node i to node j if and only if Uf = 1. 

A directed graph admits of a simple graphical interpretation, as 
illustrated by Fig. 1.17. The nodes might, for example, represent places, 
and the lines, connecting streets. A two-way street is then represented by 
a pair of oppositely directed lines, as shown between nodes 3 and 4. 

If k is any mapping vector such that 

I/!**-' = 1 for i = 2, 3,..., v(k), 

then the vector p = fcjn is called a path vector of the graph (n, U ). The 
dimension of a path vector is also called its length. Nodes k x and k v are 
called the initial and final nodes, respectively; both are also called 
terminal nodes. If j is any infix of k, then q = /Jn is also a path. It is 
called a subpath of p and is said to be contained in p. If v(q) < v(p), then 
q is a proper subpath of p. If k x — k v and p = fcjVz is a path of a length 
exceeding one, p is called a circuit. For example, if k = (6, 1, 7, 7, 2, 6, 
1, 5 ), then p = ( n 6 , n v n 7 , n 7 , n 2 , n 6 , n v n 5 ) is a path vector of the graph 
of Fig. 1.17, which contains the proper subpaths (n 7 , n 2 , n 6 ), (n l9 n 7 , 
n 7 , n 2 > n 6> n i)> an d ( n 7 > n T)> the last two of which are circuits. Node j 
is said to be reachable from node i if there exists a path from node i to 
node j. 

Ordered trees 

A graph (such as Fig. 1.16) which contains no circuits and which has 
at most one branch entering each node is called a tree . Since each node 
is entered by at most one branch, a path existing between any two nodes 
in a tree is unique, and the length of path is also unique. Moreover, 
if any two paths have the same final node, one is a subpath of the 
other. 

Since a tree contains no circuits, the length of path in a finite tree is 
bounded. There therefore exist maximal paths which are proper subpaths 
of no longer paths. The initial and final nodes of a maximal path are 
called a root and leaf of the tree, respectively. A root is said to lie on the 
first level of the tree, and, in general, a node which lies at the end of a path 
of length j from a root, lies in the y'th level of the tree. 
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Figure 1.17 A graphical representation of the directed graph (n, 17). 

A tree which contains n roots is said to be n-tuply rooted . The sets of 
nodes reachable from each of the several roots are disjoint, for if any 
node is reachable by paths from each of two disjoint roots, one is a proper 
subpath of the other and is therefore not maximal. Similarly, any *node 
of a tree defines a subtree of which it is the root, consisting of itself and 
all nodes reachable from it, with the same associations as the parent 
tree. 

If for each level y, a simple ordering is assigned to each of the disjoint 
sets of nodes reachable from each node of the preceding level, and if the 
roots are also simply ordered, the tree is said to be ordered. Attention will 
henceforth be restricted to ordered trees, which will be denoted by upper¬ 
case boldface roman characters. The height of a tree T is defined as the 
length of the longest path in T and is denoted by ^(T). The number of 
nodes on level j is called the moment of level j and is denoted by (x ; (T). The 
vector p.(T) is called the moment vector. The total number of nodes in T is 
called the moment of T and is denoted by ju(T). Clearly, K(*(T)) = HT), 
and +/p.(T) = //(T) = v{ri). The number of roots is equal to (x^T), and 
the number of leaves will be denoted by A(T). 

The number of branches leaving a node is called its branching ratio or 
degree , and the maximum degree occurring in a tree T is denoted by <5(T). 
The dispersion vector of a tree T is denoted by v(T) and is defined as 
follows: v x (T) = ^(T), and for j = 2, 3,..., v(T), v,(T) is equal to the 
maximum over the branching ratios of the nodes on level j — 1. For the 
tree of Fig. 1.16, v(T) = (3, 3, 4, 3, 2). The number of roots possessed by 
a tree T (that is, v^T)) is called its dispersion. A tree possessing unity 
dispersion is called rooted or singular. 



U = 


0 0 0 0 1 0 
0 0 0 0 0 1 
0 0 0 1 1 1 
0 0 1 0 0 0 
0 0 0 0 0 1 
1 0 0 0 0 0 
\o 1 0 0 0 0 
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Each node n* of a graph (and hence of a tree) may be identified by its 
index i. Since a tree admits of more convenient index vectors, the under¬ 
lying index i will henceforth be referred to as the graph index. 

In an ordered tree, any path of length k from a root can be uniquely 
specified by an index vector i of dimension k , where i x specifies the partic¬ 
ular root, and the remaining components specify the (unique) path as 
follows: the path node on level j is the i i r th element of the set of nodes on 
level j reachable from the path node on level y-1. The node at the end of 
the path can therefore be designated uniquely by the index vector i. The 
degree of node i will be denoted by d (f, T). The index vectors are shown 
to the left of each node in Fig. 1.16. 

The path from a root whose terminal node is i will be denoted by T*. 
In Fig. 1.16, for example, T* = (n 2 , n 8 , n 13 , n 24 ) if i = (2, 2, 2, 3). A 
vector i is said to be an index of T if it is the index of some node in T. 

The subtree of T rooted in node i will be denoted by T t *. Thus in Fig. 
1.16, P = T (2j2 , 2 ) is a rooted subtree with v(P) = (1,3, 2), and p.(P) = 
(1, 3, 3). A path in T, is denoted by (T f ) y . For example, if G is an 
ascending genealogical tree* with the sword and distaff sides denoted by 
the indices 1 and 2, respectively, then any individual x and the nearest 
(n — 1) paternal male ancestors are represented by the path vector (G,) €(w) , 
where i is the index of x in G. 

Example 1.5. Determine the index i such that the path T* is equal to a given 
argument x and is the “first” such path in T; that is, the function 

(a'(*>MT)) _L i 

is a minimum. 

Program 1.18. The index vector i specifies the path currently under test. Its 
last component is incremented repeatedly by step 7 until the loop 6-8 is terminated. 
If the path T* agrees with the corresponding prefix of the argument x, termination 
occurs through the branch to step 9, which tests for completion before step 10 
augments i by a final zero component. Step 5 then respecifies d as the degree of 
the penultimate node of the set of d paths next to be tested by the loop. Termina¬ 
tion by a branch from step 6 to step 2 occurs if all d possible paths are exhausted 
without finding agreement on step 8. In this event, retraction by one level occurs 
on step 2, and d is again respecified. If v(i) = 1, the paths to be searched com¬ 
prise the roots of the tree and d must therefore be specified as the number of 
roots. This is achieved by executing step 3 and skipping step 5. Retraction to a 
vector i of dimension zero occurs only if ail roots have been exhausted, and final 
termination from step 4 indicates that the tree possesses no path equal to the 
argument x. 

* Although such a genealogical tree is not necessarily a tree in the mathematical 
sense, it will be assumed so for present purposes. 
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Program 1.18 Determination of i such that T* = x 


If d is a vector of dimension v(n) such that d { is the degree of node n i of 
a tree T, then d is called the degree vector associated with n. In Fig. 1.16, 
for example, 

d = (3,2, 4, 0,0, 0,2,..., 1,0,0). 

Moreover, if n is itself the alphabet (that is, n = (a, b, c,. .., z)), then 
the vector ri of Table 1.19a is a permutation of n , and d' is the associated 
degree vector. Table 1.19b shows another such pair, n" and d". 

The degree vector provides certain useful information most directly. 
For example, since each leaf is of degree zero, A(T) = +/(d = 0). More¬ 
over, the number of roots is equal to the number of nodes less the total of 

the degrees, that is, p-^T) = v(d )- \-jd , and the maximum degree 

occurring in T is given by <5(T) = ((cf d)jd) v Finally, the degree vector 
and the node vector together can, in certain permutations (those of Table 
1.19), provide a complete and compact description of the tree. 

Right and left list matrices 

If each one of the /u( T) index vectors i of a tree T is listed together with 
its associated node (T i ) v(i) , the list determines the tree completely. Since 
the index vectors are, in general, of different dimensions, it is convenient 
to append null components* to extend each to the common maximum 
dimension v(T). They may then be combined in an index matrix of 

* In the 1-origin indexing system used here it would be possible to use the numeric 
zero to represent the null. In 0-origin indexing, however, zeros occur as components of 
index vectors and must be distinguishable from the nulls used. 
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Full left list matrix [T Full right list matrix ]T 

(a) (b) 

Table 1.19 Full list matrices of the tree of Fig. 1.16 


dimension //(T) x v(T), which, together with the associated node vector, 
completely describes the tree, If, for example, the node vector n is the 
alphabet, the tree of Fig. 1.16 is described by the node vector ri and index 
matrix /' of Table 1.19a or, alternatively, by n" and I" of Table 1.196. 

Because of the utility of the degree vector, it will be annexed to the array 
of node vector and index matrix, as shown in Table 1.19a, to form a full 
list matrix of the tree. The degree vector and node vector together will be 
called a list matrix. As remarked, the list matrix can, in certain permuta¬ 
tions, alone describe the tree. 

Formally, the full list matrix M of a tree T is defined as follows: a 2 /M 
is an index matrix of the tree, M x is the associated degree vector, and M 2 
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is the associated node vector. Thus for each k e ^(//(T)), Mf = 6 (i, T), 
and Mf = (T*) v(i) , where i is the nonnull portion of a 2 /M k , that is, i = 
((a 2 /M k ) ^ o£)/a 2 /M fc ). The corresponding list matrix is a 2 /M. 

Since a full list matrix provides a complete description of a tree regard¬ 
less of the order in which the nodes occur in the list, any column permu¬ 
tation M p (that is, any reordering among the rows) is also a full list matrix. 
Two particular arrangements of the full list matrix are of prime interest 
because each possesses the following properties: (1) the nodes are grouped 
in useful ways, and (2) the list matrix (i.e., the degree vector and node 
vector) alone describes the tree without reference to the associated index 
matrix. They are called the full left list matrix and full right list matrix and 
are denoted by [T and ]T, respectively. Table 1.19 shows the full left and 
full right lists of the tree of Fig. 1.16. 

The left list index matrix I is left justified,* that is, the null elements are 
appended at the right of each index. The rows V are arranged in increasing 
order on their values as decimal (or rather (d(T) + l)-ary) fractions with 
the radix point at the left and the nulls replaced by zeros. More precisely, 
the rows are arranged in increasing order on the function (v(a)e) _]_ 
(a i 0 T), where a = ( o, 1, 2, .. ., <$(T)).t 

The right list matrix is right justified and is ordered on the same function, 
namely (v(a)e) J_ (a t Q I j ). The rows are therefore ordered on their values 
as integers, i.e., with the decimal point at the right. From the example of 
Table 1.1 9b it is clear that the right list groups the nodes by levels, i.e., 
level j is represented by the infix (/ [ a /J )//(]T), where k = fi^(T), and 
i = In Table 1.1 9b, for example, p.(T) = (3, 7, 8, 5, 3), and 

if j = 3, then k = 8, i = 10, and level j is represented by rows i + 1 = 11 
to i -f k = 18. The right list is therefore useful in executing processes 
(such as the /?th degree selection sort) which require a scan of successive 
levels of the tree. 

The left list groups the nodes by subtrees, i.e., any node i is followed 
immediately by the remaining nodes of its subtree T ( . Formally, if I = 
a 2 /[T, and if i = (I k ^ °€)/P, then the tree T f is represented by the infix 
((A; — 1) l In Fig. 1.19a, for example, if k = 16, then i = 

(2, 2, 2), //(T f ) = 7, and T f is represented by rows 16 to 22 of [T. The left 
list is therefore useful in processes (such as the construction of a Huffman 
code and the evaluation of a compound statement) which require a 
treatment of successive subtrees. 

The row index of a node in a right (left) fist matrix is a graph index of 
the node and will be called the right (left) list index . 

* The term left list and the notation [T are both intended to suggest left justification. 

t These statements hold only for 1-origin indexing. In 0-origin indexing, a = 
(°,0,1,...,<5(T) -1). 
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A two-column matrix which forms the right list of some tree is said to 
be a well formed right list. Since the ordering of the nodes in a right list of 
a given tree is unique, the right list of a given tree is unique. Conversely, 
any well formed right list specifies a unique tree according to the algorithm 
of Program 1.20. 

Identical remarks apply to the left list, except that Program 1.20 is 
replaced by Program 1.21. Moreover, the necessary and sufficient 
conditions for the well formation of a left list are identical with those for a 
right list and are derived by virtually identical arguments. The case will 
be stated for the right list only. 

If R is a well formed right list representing a tree T, then the dispersion 
(i.e., the number of roots) v^T) = v(R 1 ) — ( +/R h ) must be strictly 
positive. Moreover, if S = a j l/R is any suffix of R, then £ is a right list of 
the tree obtained by deleting from T the first j nodes of the original list. 
For, such level-by-level deletion always leaves a legitimate tree with the 
degrees of the remaining nodes unchanged. Consequently, the number of 
roots determined by every suffix of R x must also be strictly positive. In 
other words, all components of the suffix dispersion vector s defined by 

s, = via^/R,) - (+/6P- 1 /*!), j e i\v{R x )) 

must be strictly positive. The condition is also sufficient. 

Sufficiency is easily established by induction on the column dimension 
of R. The condition is clearly sufficient for v(R 1 ) = 1. Assume it sufficient 
for dimension v(R 1 ) — 1. If s, the suffix dispersion vector of R, is strictly 
positive, then oP/s, the suffix dispersion vector of a 1 f/R, is also positive, 
and by hypothesis a 1 //# represents a tree G possessing s 2 roots. Moreover, 

0 < s ± = s 2 + (1 - Rf) 

implies that s 2 > ^i\ and the number of roots possessed by G therefore 
fulfills the number of branches required by the added node Rf. A 
legitimate tree corresponding to R can therefore be formed by joining the 
last Rf roots of G to the node Rf. 

Tests for well formation can therefore be incorporated in any algo¬ 
rithm defined on a right or left list matrix M by computing the components 
of the suffix dispersion vector s. The recursion s = s t -f 1 — M\~ l is 
convenient in a backward scan of M, and the equivalent recursion 
s t = — 1 -f Mj -1 serves for a forward scan. The starting condition 

for a forward scan is s 1 = v(M 1 ) — (+/M 1 ), and for a backward scan is 
s v = 1 — Mf. Since the criteria of well formation are identical for right 
and left lists, a matrix may be characterized simply as well or ill formed. 
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The purpose served by the degree vector d in the description of a tree is 
sometimes served instead [e.g., Burks et al. (1954)] by the vector g = 
€ — d. It is somewhat more convenient in the analysis of well formation, 
since the expression for the suffix dispersion vector then simplifies to 

S i+1 = (+Mg)> or 5 = (I + □)■//£• 

The index matrix as a function of the degree vector 

The complete determination of the tree corresponding to a given list 
matrix M is best described as the determination of the associated index 
matrix I. For both left and right lists this can be achieved by a single 
forward scan of the rows of M and of I. 

For a right list R it is first necessary to determine r, the number of roots. 
The first r components of R are then the roots of the tree in order, the 
next R i 1 components of R are the second-level nodes reachable from the 
first root, and so forth. Programs 1.20 and 1.21 describe the processes for a 
right list and a left list, respectively. 

Program 1.20. In each execution of the main loop 13-16, the /th row of the 
right list R is examined to determine the index vector of each node on the succeed¬ 
ing level which is directly reachable from it. The number of such nodes is con¬ 
trolled by the parameter d , initialized to the degree of the /th node by step 12. 
The (right list) index of the nodes reachable from node / is determined by /, which 
is incremented on step 14 as the index vector of each node is determined. The 
index vectors of the successive nodes reachable from node / have the final com¬ 
ponents 1, 2, 3, ... , and each must be prefixed by the index vector of node i. 
This assignment is effected by the vector v , which is initialized by the index vector 
of node i rotated left by one (step 11), and which is incremented by step 15 before 
each assignment occurring on step 16. At the outset, v is set to zero and d is set 
to the number of roots as determined by step 4. 

Since j is, at step 10, equal to the current number of roots r augmented by the 
cumulative degrees of the first / — 1 nodes, then r = / — / + 1 and the exit on 
step 10 therefore occurs always and only in the event of ill formation. Alterna¬ 
tively, the test can be viewed as an assurance that each row of the matrix I is 
specified before it is itself used in specification. 

When step 5 is first reached, the index matrix I is complete but is expressed in 
1-origin indexing with zeros representing the null elements. Steps 5-7 translate 
the matrix to the origin </> and mask in the necessary null elements. 

Program 1.21. The index vectors P are determined in order under control of 
the parameter /. The loop 5-18 traces a continuous path through the tree, deter¬ 
mining the index of each successive node of the path by rotating the index of the 
preceding node (step 17) and adding one to the last component (step 13), and 
maintaining in the connection vector c a record c i+l of the index j of the successor 
of node / in the path traced. The path is interrupted by the occurrence of a leaf 
(that is, = 0 on step 18), and the degree vector is then scanned by the loop 
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1-origin indexing 

R 

Right list of T. 

I 

Right index matrix of 

T. 

i 

Index of row of R cur¬ 
rently examined. 

j 

Right list index of node 
reachable from node i. 

V 

Current index vector. 

<t> 

Origin with respect to 
which I is finally ex¬ 
pressed. 



Program 1.20 Determination of the index matrix I associated 
with a right list matrix R 

(19-20) to determine the index i of the last preceding node whose branches remain 
incompleted. Steps 22-23 then respecify v as the index vector of the node follow¬ 
ing node i in the path last traced, and step 21 decrements the component L-f of 
the degree vector. The branch from step 19 to step 22 occurs at the completion 
of each rooted subtree. The test for well formation (step 12) is the same as that 
applied to the right list in Program 1.20, except that the notation for the relevant 
parameters differs. The concluding operations (6-9) include left justification on 
step 7. 

Tree, path, and level compression 

The tree compression 

P U/T 

specifies a tree P obtained from T by suppressing those nodes corre¬ 
sponding to zeros of the logical tree U, and reconnecting so that for every 
pair of nodes x, y of P, x belongs to the subtree of P rooted in y if and only 
if x belongs to the subtree of T rooted in y. If, for example, T is the tree of 
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1-origin indexing 

L 

Left list of T. 

I 

Left index matrix of T. 

j 

Index of row of I being 
determined. 

i 

Left list index of path 
node preceding node j 
in current path (Step 
16), or index of last 
previous node whose 
branches remain un¬ 
exhausted (step 22). 

C i+l 

Index of node follow¬ 
ing node i in last path 
traced from /. 

r 

Parameter for testing 
well formation. 

V 

Current index vector. 

<t> 

Origin with respect to 
which I is expressed. 


Legend 


Program 1.21 Determination of the index matrix I associated 
with a left list matrix L 


Fig. 1.16 with n as the alphabet, and U is the tree of Fig. 1.22 a, then P 
is the tree of Fig. 1.22 b. The new indices are shown to the left of each node 
of P. The set of nodes 221, 222,..., 226, are all on the same level of P 
although they have been shown with branches of different lengths to 
permit easy identification with the nodes of the original tree T. 
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U U/T 

(a) (b) 

Figure 1.22 Compression of tree T of Fig. 1.16 (with n — alphabet) 

The compress operation is best executed on the left list because of the 
grouping by subtrees. Program 1.23 gives a suitable algorithm which also 
serves as a formal definition of the compress operation. 

Program 1.23. The vector u is specified as the node vector of the left list of the 
controlling logical tree U and controls the subsequent process. Step 4 determines 
j as the index of the first zero component of u. Steps 6 and 7 then delete the corre¬ 
sponding nodes of u and of the left list of T, but only after step 5 has determined d 
as the change in degree which this deletion will occasion to the root of the smallest 
subtree containing the deleted node. Steps 9-11 perform a backward scan of the 
degree vector to determine j as the index of the root of the subtree, and step 12 
effects the requisite change in its degree. The exit on step 9 occurs only if the node 
deleted is a root of the original tree, in which event no change is produced in the 
degree of any other node. 

Two further compress operations controlled by logical vectors are 
defined as follows. Path compression is denoted by 

P uj T. 
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1 


u ([U) 2 

2 

L - « 2 /([T) 

3 


u : e 

4 


./' — ( + /“/«) + 1 

5 


f 

1 

6 


U e)\u 

7 


L *- e'ljL 

8 


r <- 1 

9 


j +~j ~ 1 

10 


r <- r — Lp + 1 

11 


r : 1 

12 


Lp ■<— Lp + d 



1-origin indexing 

u 

Left node vector of U. 

L 

Left list of T. 

j 

Index of first zero of u 
(Steps 4-8). Index of root 
of smallest subtree contain¬ 
ing deleted node (Step 12). 

d 

Change of degree caused by 
deletion of node j. 

r 

Number of roots indicated 
by infix (a 3 ' A a*)//L, where 
j is initial value and k + 1 
is current value of j. 


Legend 


Program 1.23 Determination of the left list L = a 2 /[(U/T) 


P is obtained from T by suppressing every node on level j if = 0, and 
reconnecting as in tree compression. Level compression is denoted by 

P-W/T, 

and P is obtained from T by deleting each rooted subtree T* for which 
u t = 0. 

Path compression by a unit vector e j produces a tree of height one. 
Such a tree is, in effect, a vector and will be treated as one. 

Two related special logical trees are defined: the path tree U E such that 
u/ u E = 0 and u\ u E is the full tree E whose nodes are all unity, and the 
level tree U E such that w// u E = 0, and u\\ u E = E. 

Extension of other operations to trees 

Two trees are compatible if they have the same structure. Elementary 
binary operations are extended node by node to compatible trees. For 
example, 

Z X x Y 

implies that node i of Z is the product of node i of X and node i of Y for all 
/. Similarly, 


M b h T 
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specifies M as a tree (of the same structure as T) such that node i of M is 
the y-origin 5-index of node i of T. 

The mapping operation is extended to trees so as to permute the rooted 
subtrees of a tree. Formally 

implies that ^(P) = v(m), that P* is a single null character if m i i 3 (p. 1 (T)), 
and otherwise P* = T m , where y-origin indexing is used for T. 

Permutation of the subtrees rooted in node i of T can be effected as 
follows: 

mj (oP/T,.). 

The notation O//T will denote the application of the binary operator or 
relation o to the nodes of T in right list order (i.e., down successive levels) 
and o/T will denote the same application in left list order (i.e., across 
paths). If the operator is symmetric (i.e., its operands commute), then 
O//T = O/T. 

Maximization (U|T) and minimization (U[T) are extended to trees in 
the obvious way. 

The operations a/w, co/u 9 oja, and rja are each extended in two ways: 
across paths and down levels. Examples of each appear in Fig. 1.24. 
Operations extending down levels are denoted by double virgules and 
represent an application of the corresponding vector operation to each 
level of the tree considered as a vector. For example, the statement 

V a//A 

implies that each level of V is the forward set selection of the corresponding 
level of A, that is, e j /\ = o/e j /A. Operations extending across paths are 
denoted by single virgules and are defined in terms of subtrees. Thus 

y «-<x/u 

implies that V is obtained from the logical tree U by setting to zero all 
nodes of any subtree rooted in a zero node, and 

V co/U 

implies that V is obtained from U by setting to zero every node whose 
subtree contains a zero node. The definitions of <r/U and r/U are analo¬ 
gous. 

Homogeneous trees 

If, for ally, every node on level j of a tree T is either of degree zero or of 
degree v m (T), then the tree T is said to be uniform. If all leaves of a 
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uniform tree T lie in the same level (necessarily the top), then the tree is 
said to be homogeneous. The structure of a homogeneous tree is completely 
characterized by its dispersion vector v(T). All maximal paths in a homo¬ 
geneous tree are clearly of the same length, namely v(T) = v(v(T)). Figure 
1.25 shows a homogeneous tree and its associated dispersion vector. 



co/U 



<r/U 



r/U 



Figure 1.24 Set selection and maximum prefix and suffix operations 
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A tree T for which v(T) = me is called an m-way tree , and a tree for 
which v^T) = 1 and a 1 /v(T) = me is called a singular m-way tree. 

The yth component of the moment vector of 
a homogeneous tree is clearly equal to the prod¬ 
uct of the first j components of the dispersion 
vector, that is, p.(T) = (□ + /)* v(T). The dis¬ 
persion vector is, in turn, uniquely determined 
by the moment vector. The total number of 
nodes is given by jlc( T) = +/p.(T), and it can also 
be shown that //(T) = y J_ y, where y is the 
dispersion vector in reverse order. 

Tree compression of a homogeneous tree H 
(that is, U/H) does not generally produce a 
homogeneous tree, and, in fact, any tree P of 
arbitrary structure can be represented by a pair 
of homogeneous trees U and H such that P = 
U/H. On the other hand, both path and level 
compression of homogeneous trees produce 
homogeneous trees. Moreover, if P = u/ H, then 
v(P) = t//v(H), and if P = «// H, then v(P) = 
v(H) - (+/«)a 1 . 

Since the structure of a homogeneous tree is 
completely specified by its dispersion vector k, 
the structure of the special logical trees can be 
specified in the forms E(k ), u E{k ), and u E{k). 

In a homogeneous tree, the right list or left 
list index of a node can be determined as an 
explicit function of its index vector. Conversely, 
the index vector i can be determined directly 
from the corresponding left list index, to be 
denoted by /(/), or from the right list index r(t). 
In developing the relations between indices it 
will be convenient to use 0-origin indexing 
throughout. 

The right list index is given by 
r(i) =/(i) + g(i), 
where /(«') = +/a'’ ( ‘ >_1 /^(T) 
is the number of nodes in the first v(i) 1 levels, and 

g(i) = (<x” (,) /v(T)) X i 

is the rank of node i in the v(i) th level. For example, if / = (1, 0, 1) in the 



i/(H) = (2,3,2) 
M(H) = <2,6, 12) 

Figure 1.25 Homogen¬ 
eous tree H and disper¬ 
sion and moment vectors 
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tree of Fig. 1.25, then jjl(H) = (2, 6, 12), /(/) = +/(2, 6) = 8, and 
g(i) = (2,3, 2) _L (1,0,1) = 7. 

Since/(/) depends only on v(i), the index i may be determined from r by 
first determining v(i) as the largest value for which /(*) < r, and then 
determining i such that 

(a v<i> /v(T)) If = r —/(*)• 

In tracing a path through a tree, the k\h node of the set reachable from 
node i is the node j = * © (k). It is therefore useful to express r(j) as a 
function of r(i). Clearly 

/O') =/(*') + (KT))„ ( o-i, 

S0‘) = #(0 X (v(T)),.«) +jv- 1 . 

In the special case of a singular homogeneous m-way tree, 

/(f) — 1 + m -}- m 2 + * • • + m v{t) ~ 2 = (me) J_ e(r(i) — 1) 

wi)—l ^ 

_ m — 1 

m — 1 

Hence /(j) = 1 + m x /(/), and ^(j) = m x g(f) +7 V _ 1 . Recursion can 
therefore be performed simply upon the single function r(f) as follows: 

r(j) = m X r(i') + 1 + /„-i. 

The left list index /(i) is most conveniently expressed as a function of 
v{i) and of the vector z(i) (zero extension of /), where z = a v{i) (v(T))\i. 
Clearly v(z) = r(T) and z is the index of the “earliest” leaf reachable from 
node i. In Fig. 1.25, for example, z((l, 2)) = (1, 2, 0). 

The zero extension has the obvious property that every node above the 
path T z(t) precedes node i in the left list, and every node below the path 
follows it. The number of nodes in the path which precede node i is 
v(i) - L 

The number of leaves above the path T z{i) is v(T) _]_ z(i), and more 
generally, the number of (j — l)th level nodes above it is given by 
(a ? /v(T)) J_ ( a V z (0)- Consequently, 

HT) 

l(i) = v(i) -1 + 2 («7v(T)) _L (a j lz(i)). 

3 = 1 

For example, if i = (1, 0) in Fig. 1.25, then z(i) = (1,0, 0) and 
l(i) = v(i) — 1 + (2) _L (1) + (2, 3) JL (1, 0) + (2, 3,2) _i_ (1,0,0) = 11. 
The foregoing result may be written alternatively as 
/(/) = v (i) - 1 + w x *(0, 

where w v = 1, and = 1 + x v t (T)). In the foregoing example, 
w = (10, 3, 1), and w + z(i) = 10. This form is most convenient for 
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determining i as a function of /, for since w + z = l + 1— v(i ), then 
z 0 (i ) = [/ -r wj, z 1 (i) = l((t4> 0 | /) — 1) -T- wj, etc. for all positive values 
of the quotient, and- all components thereafter are zero. The dimension 
v(i) is then determined from the relation v(i ) = / + 1 — w J z(i). 
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EXERCISES 

Organize each of the programs according to the method of leading decisions. 
Except where otherwise indicated, use 1-origin indexing. The conventions of 
Sec. S.l of the Summary of Notation will be used in the statement of each 
of the exercises. 

1.1 Let d = (a, 2, 3,4, 5,6,7, 8,9,10,j,q,k),5 =(c,d,h,s),u = (1, 0, 1,0,1), 
v = (0, 1, 1, 1, 0), x = (16, 8, 4, 2, 1), and y = (2, 3, 4, 5, 6). Determine 

(a) the dimensions v(d), v(s), and v(x). 

(b) the vectors x + y, x — y, x x y, x 4- y, and u + v. 

(c) the logical vectors u A v, u y v, (u ^ v), and (« = v). 

(d) the reductions +/*, x/y, A/u, and vM 

(e) the base two values of u and of r, that is, +/(* x u), and +/(x x v). 

(f) the rotated vectors 2 J, d, 4 f s, and \y. 

(g) the unit vector € x (5) in a 1-origin system, and c 3 (5) in a 0-origin system. 

(h) the infixes (a 5 (7) A <*> 5 (7)) and 2 [ a 3 (7). 
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1.2 Show that 

(a) x/i \ri) — n\ (Include the case n = 0.) 

(b) +li j (n) = n(n 4- 2 j - \) + 2. 

(c) xl(k] x) = x /x. 

(d) (kjx) +(k J [y) =kU* +y)- 

1.3 Write detailed (i.e., component-by-component) programs for the following 
operations. Include tests for compatibility of the operands. 


(a) 

tv 

<-u A v. 

(g) 

u 

-<- a-Hk). 

(b) 

W 

+- u V V. 

(h) 

u 

*- i i “W- 

(c) 

b 

u/a. 

(i) 

c 

— \«, «, b\. 

(d) 

B 

<- ii/A. 

(j) 

c 

*-1<*, w, bl. 

(e) 

B 

-"MIA. 

(k) 

c 

u\a. 

(f) 

X 

- o > o)/*. 





1.4 Establish the identities 

(a) la, u, bj = \u/a , u, u/b\. 

(b) \a, u, b\ = lu\a, u, u\b /. 

1.5 The classic “rings-o-seven” puzzle can be posed as follows: an ordered 
collection of n rings is to be placed on (removed from) a bar under the following 
constraints: 

(i) ring n may be placed on or removed at will. 

(ii) ring k may be placed on or removed only if ring (k + 1) is on and all 
succeeding rings are off. 

The state of the rings can be described by a logical vector u, with u k = 1 if ring k 
is on. Write programs on u which describe the removal of the rings beginning 
with 

(a) u = € [The successive values of u represent a reflected Gray code ; see 
Phister (1958).] 

(b) u arbitrary. 

1.6 The ordered array of variables used to represent a variable x in some coding 
system may be considered as a vector representation of x , denoted by p(#). In the 
8421 code for decimal digits, for example, p(0) = (0, 0, 0, 0), p(l) = (0, 0, 0, 1), 
and, in general, p(a) is defined by the relation + /[tv x p(#)] = x , where tv = (8,4, 
2, 1). For each of the following coding systems, (see Richards, pp. 183-184 for 
definitions), write a concise expression for p(#): 

(a) the excess-three code for decimal digits. 

(b) any chosen two-out-of-five code. 

(c) any chosen biquinary code. 

(d) the semaphore code for the alphabet (see any good dictionary). Denote 
each code by a two-component vector p(a?) c: i°(8). Use a i x, where 
a = (a, b, c, .. ., z). 

1.7 Let X be a square sparse matrix represented by the logical matrix U = 
(X ^ 0) and either or both of the vectors r = U/X, and c = U//X. Write pro¬ 
grams to determine the product Y = X + X, using the arguments 

(a) f, c, and U. 

(b) r and U. 

(c) c and U. 
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1.8 Prove that 

(a) \x) = 

(b) [[a -r b\ -r cj = [a -r /?cj for all positive integers a, 6, and c. 

1.9 Let r = El A, and c = El IA be the row list and column list, respectively, of 

the matrix A, and let r h , A/, and be corresponding elements of the three 

representations of A. Determine: 

(a) h as a function of k , v(yi), and 

(b) A; as a function of /z, v(>4), and y(A). 

(c) the permutation vector h such that c = h J r. 

1.10 Show that 

(a) A /u = v/u (Use De Morgan’s law for two variables and induction.) 

(b) ^/u = 2 | 0 +/« (Use induction.) 

(c) =/u = 2 j 0 +/u. 

(d) #/u = =/u. 

(e) U * r = (2c) loj l/ + r). 

(f) U * V = U ^ V. 

(g) (t ° u) A (r ° tv) = (t o W ) A ( v o U ). 

1.11 (a) Show that + /x = +/(«/*) + +l(u/x). ([nclude the case u = 0.) 
(b) What properties are required of an operator O that it satisfy the relation 
established for + in part (a)? 

1.12 Show that 

(a ) X+Y = (UIX) + (ul/Y) + (u/X) J (u//F). 

(b) u/(X +Y) = X + (u/y). 

(c) ull(X +Y)= (u/IX) + y. 

(d) (u A v)la = (tt/r)/(tt/a). 

1.13 Use the result of Exercise 1.11(b) to extend the results of Exercise 1.12(a-c) 
to logical operators. 

1.14 Write programs to determine: 

(a) the value of the polynomial * at the point a, that is, to evaluate Q/e) * 
for y = a. Use no more than v(x) multiplications. 

(b) the derivative of the polynomial x, that is, the vector z such that 

d 

(ye) _L z = — ((ye) J_ *), and v(z) = v(x). 
ay 

(c) the integral z of the polynomial x satisfying the boundary condition 

(<Z€) _|_ z = b. 

(d) the quotient q and remainder r obtained in dividing the polynomial n by 
the polynomial d, for v(d) < v(n). 

(e) the value of the polynomial n at the point a by using part (d) with d = 
(1, -a). 

d 

(f) the value of — (0/e) J_ ri) at the point a by two applications of part (e). 

(g) an approximate real root of the equation (ye) _|_ x = 0, using parts (e) and 

(f) and the Newton-Raphson formula [Kunz (1957)]. 
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1.15 Let the components of the vector r be the real roots of a polynomial x. 
Write a program to 

(a) determine the symmetric functions of r. [Dickson (1939), Ch. X.] 

(b) determine x as a function of r. 

1.16 Write a program to determine the polynomial x consisting of the first n 
terms of the exponential series 1 + y + y 2 l 2 ! + •••. 

1.17 Write a program to determine the moduli of all roots of the polynomial x , 
using the Graeffe method [Kunz (1957)]. Assume that operations for the logar¬ 
ithm and exponential functions are available as subroutines. 

1.18 List all the 1-origin permutation vectors of dimension four which are self¬ 
inverse. 

1.19 Using 1-origin indexing, write programs to derive 

(a) the permutation k which is inverse to the permutation j. 

(b) a permutation./ which transforms a given logical vector u to a prefix vector. 

1.20 A square logical matrix U such that +jU = +//17 = € is sometimes called 
a permutation matrix , since premultiplication of a numerical vector x determines 
a permutation of x. Write programs to determine 

(a) the permutation matrix U corresponding to the 1-origin permutation vector 
fe, that is, determine U such that U + x = k x. 

(b) the permutation k corresponding to a given permutation matrix U. 

(c) the permutation V which is inverse to the permutation U. 

1.21 Let p be the vector representation of a permutation and let c be the 
standard representation in terms of disjoint cycles, including all cycles of one 
[Jacobson (1951), p. 34.] Each cycle of c is enclosed in square brackets, each 
half-bracket being considered as a component of c. For example, if c = ([, 1, 3, 
5, ], [, 2, 4, ], [, 6, ]), then p = (3, 4, 5, 2, 1, 6), v(c) = 12, and v(p) = 6, and, in 
general, v(c) = v(p) -f 2 k where k is the number of disjoint cycles in p. The 
usual elision of cycles of one would give c =([,1,3, 5, ], [, 2, 4, ]), but this deter¬ 
mines a unique correspondent p only if the dimension of p is otherwise specified, 
and inclusion of all cycles of one will therefore be assumed. If each infix of 
numerical components in c is preceded by a left bracket and followed by a right 
bracket, and if c determines a legitimate permutation vector p, then c is said to be 
well formed. 

(a) Write a program to determine p as a function of a well formed permutation 
c. Include determination of the dimension of p. 

(b) Modify the program of part (a) to incorporate checks on the well formation 
of c. If c is ill formed, the vector p is to be defined as the literal “ill 
formed.” 

(c) Modify part (b) to process a sequence of vectors c 1 , c 2 , . . ., each pair being 
separated by a single null element, and the end of the sequence being 
indicated by a pair of successive null elements, i.e., to process z = 
c 1 © (°) © c 2 © * • * © c r © (o, o). Include checks on the well formation 
of each permutation. 

(d) Write a program to determine the parity [Jacobson (1951), p. 36] of a 
permutation vector p. 
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1.22 Write detailed programs for the following processes: 


(a) 

k 

-djx 

(>) 

m -<- b f 0 a 

(b) 

y 

m ^ x 

(j) 

M +-B? 0 a 

(c) 

V 

U \ X 

(k) 

u *- 


(d) 

V 

o 

+-u\X 

(1) 

c <- 

b n a 

(e) 

V 

<- a/« 

(m) 

c «- 

b A a 

(f) 

V 

-co//l7 

(n) 

c «- 

b kj a 

(g) 

V 

a/b 

(o) 

C — 

a ® b 

(h) 

V 

t/ /B 





1.23 (a) Copy onto file <3> x 2 successive groups of items from the row of files O 1 

in cyclic order, omitting any exhausted files. The end of each group is 
demarked by a partition X 2 , and the end of each file by a partition X 3 . 
(b) A file which is always recorded in the forward direction and read in the 
backward direction functions as a stack. Using file <I> 2 2 as a stack, 
modify the program of part (a) so as to reverse (in the output file Oj 2 ) 
the order of the items within each group. 

1.24 The accompanying node vector n and connection matrix C together specify 
a directed graph (C/ = 1 indicates a branch from node / to node j) which is, in 
fact, a tree. 

n = (a, b , c, d, e, f,g) 

1 0 0 0 0 1 1 0 

0 0 0 0 0 0 0 

1 0 0 1 0 0 0 

C = 0000000 

0 0 0 0 0 0 0 

0 0 0 0 0 0 0 

0 1 0 0 0 0 0 

(a) Draw one of the possible ordered trees represented by n and C. 

(b) For the tree T of part (a) show the full left list [T. 

(c) Show the full right list ]T. 

1.25 Write programs which include tests on compatibility and which determine 

(a) L = [T from R = ]T 

(b) 5 = ](u/T) from *x( T), ]T, and u 

(c) M = [(u//T) from L = [T and u 

(d) M = [(k T) from L = [T and k 

1.26 (a) Give an example of a well formed right list which demonstrates that a 

prefix of a right list is not, in general, well formed. 

(b) Show clearly where the argument used in establishing the well forma¬ 
tion of any suffix of a well formed list breaks down when applied to a 
prefix. 
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1.27 Give formal proofs for the facts that 

(a) a left list groups nodes by subtrees. 

(b) a right list groups nodes by levels. 

(c) ^(T) = v{d) -h/d, where d is the degree vector of T. 

1.28 Write programs to determine fi(T) as a function of 

(a) the left list degree vector of T. 

(b) the right list degree vector of T. 

1.29 Trace Programs 1.20 and 1.21 for the tree of Exercise 1.24. 

1.30 Show that for a homogeneous tree H, /<(H) = y _|_ y, where y = v(H). 

1.31 If H is homogeneous, v(H) = (3, 2, 3, 4), and i =(1,0, 2), determine, in a 
0-origin system 

(a) the left list index l(i). 

(b) the right list index r(i). 

(c) the index j of the node whose left list index is 27. 

1.32 (a) If K = l°(u) [ (e(n) £ i°(aO), show that K + K = n(E — /). 

(b) If y is any permutation of x and v(x) = «, show that x + K + x = 
y + K+y. 

1.33 Using the Euclidean algorithm, write programs to determine: 

(a) d as the greatest common divisor of positive integers x and y . 

(b) d as the g.c.d. of x and y where d, x, and y represent polynomials in z 
(e.g., (ze) _L x). 

1.34 To assure uniqueness, the number of different digits (symbols) used in a 
base b number system must not exceed b. The limitation to the particular range 
0 < cii < b is, however, not essential. For example, a base three system can be 
constructed using digits —1,0, and 1, for which it is convenient to adopt the 
symbols —, 0, and + , respectively. The positive numbers beginning at zero are 

then represented by the sequence 0, + , H—, +0, + +, H-, + —0, H-h, 

+0—, +00, etc. The negative numbers beginning at zero are 0, —, —h, —0, 
-, —h +, —h0, —I—, —0 + , —00, etc. 

(a) Construct addition and multiplication tables for this number system and 

calculate the sum and the product of the numbers 0— and-. Use the 

decimal system to check all results. 

(b) Negative numbers are represented in this system without the attachment 
of a special sign position. Special rules regarding sign are therefore 
banished except that it is necessary to formulate a rule for changing the 
sign of a number, i.e., to multiply by minus one. Formulate such a 
rule. 

1.35 For any integer n, let x 2 = 2 | 0 n, x s = 3 | 0 n, x 5 = 5 | 0 n, and = 7 | 0 n. 
As shown by Garner (1959), the ordered array (x 2 , x z , x 5 , x 7 ) provides a representa¬ 
tion of the integer n in a so-called residue number system. 

(a) Write the residue representations of the first ten nonnegative integers. 

(b) For integers n in the range 0 < w < (2 x 3 x 5 x 7) show: 

(1) that the representation is unique. 

(2) that an addition algorithm may be defined which treats the several 
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columns independently, i.e., there are no carries. (The sums must 
also lie within the specified range.) 

(c) Discuss the choice of moduli for extending the range of the representa¬ 
tion. 

(d) Show that the algorithm derived in part (b) is valid for all positive and 
negative integers in the range —a/2 < n < a/2 for a = 2 x 3 x 5 x 7. 

(e) Derive an algorithm for obtaining —n from n. 

(f) Derive an algorithm for multiplication. 

(g) The sign of the number (i.e., its relation to zero) is not displayed 
directly by this representation. Convince yourself that its determi¬ 
nation is nontrivial. 

1.36 Let x, y , and z be the positional representations of the numbers z, y, and 2 
respectively. Using the floor and residue operations, write programs to deter¬ 
mine z as a function of x andjy, where z = x + y and the representation in use is 

(a) base b. 

(b) mixed base b. 

(c) the 4-, —, 0 base three system (of Exercise 1.34). 

(d) the residue number system (of Exercise 1.35). 

1.37 Write programs for the multiplication z = x x y for each of the cases of 
Exercise 1.36. 

1.38 Write programs to convert in each direction between the following pairs of 
number systems: 

(a) base b x and base b 2 . 

(b) base b 1 and base b 2 . 

(c) base three and the +, —, 0 base three of Exercise 1.34. 

(d) residue and base b (Exercise 1.35). 

1.39 (a) Show that the superdiagonal matrices satisfy j l + k l = ^ + ^/. 

(b) A matrix of the form J = (xl 4 - 1 I) is called a Jordan box. Write the 
expansion of the Azth power of J. 

(c) Show that X + Y = X 1 ° Y 1 + X 2 ° Y 2 + • • • + X v(x) ° Y« x \ 

(d) Determine an explicit solution to the set of linear equations A + x =y, 
where u\x = a and v/y = b are known and where 4 -\u 4 - 4 -/r = 
v(A) = y(A). State the conditions for the existence of a unique solu¬ 
tion. 

1.40 Any nonsingular matrix A can be reduced to the identity / by a sequence of 
row operations of the form A i xA i 4- yA\ or A 1 <—> A j . The process which 
accomplishes this (using row operations only) by reducing successive column 
vectors to the successive unit vectors is called Jordan or complete elimination. 
If the same sequence of row operations is executed upon the identity matrix, it 
will be transformed to the matrix B such that B + A = /. The inverse of A can 
therefore be obtained by performing Jordan elimination on the matrix M = 
A © I so as to reduce the first v{A) columns to the identity. The last v(A) 
columns are then the inverse of A. 

(a) Write a program to determine the inverse of A by Jordan elimination. 
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(b) The sequence of operations which reduce the /th column of A to e* is called 
the /th step of the process, and the/th diagonal element at the beginning of 
the /th step is called the /th pivot element. Modify the program of part (a) 
so that each step is preceded by a column permutation which yields the 
largest (in absolute value) pivot element possible. This modification tends 
to reduce the accumulation of round-off errors. 

(c) [n the Jordan elimination of part (a), it is unnecessary to store the identity 
matrix explicitly, and, since the /th column is first affected at the /th step, 
only one new column need be brought in at each step. Moreover, the /th 
column of A may be discarded after its reduction to e i on the /th step, and 
it is therefore necessary to store only a square matrix at all times. Show 
that by shifting all columns to the left and by moving the rows upward 
cyclically, a very uniform process results, with the pivot element in the 
leading position at every step [Iverson (1954) or Rutishauser (1959)]. 
Write a program for the process. 

(d) Modify part (c) to allow the choice of pivot elements as in part (b). The 
effects of the permutation on the not explicitly recorded identity cannot be 
indicated directly, but the performance of the same set of permutations in 
reverse order upon the rows of the resulting inverse produces the same 
result. Verify this and program the process. 

1.41 (a) Show that a group [Jacobson (1951)] can be represented by a square 

matrix M such that each row and each column is a permutation vector. 

(b) Show that M i = = i 1 for some /. 

(c) What are the necessary and sufficient conditions that the group repre¬ 
sented by M be Abelian ? 

(d) Write a program to determine all cyclic subgroups of a group repre¬ 
sented by M. 

1.42 If U is a logical matrix whose rows are each nonzero, mutually disjoint, and 
collectively exhaustive (that is, (+/U > €) = €, and +//17 = e), then U defines 
an m-way partition of n , where m = /u(U), and n = v(U). The partition is more 
commonly represented by the vector p — +/U [Riordan (1958), p. 107]. Clearly 
+lp = n. Write a program to generate 

(a) all partitions U of a given integer n. 

(b) all distinct partitions of n , where U and V are considered equivalent if 
p = +/U is a permutation of q = +/V. 

1.43 Let x be a space vector (i.e., of dimension three), and let R(x) be the square 
matrix i f (c ° x). Show that 

(a) +IR(x x y) = (x + y) x e 

(b) e + (x xy) = x+y 

(c) (+IR(x x ;y)) + (w x z) = (* + y) x (w + z) 

(d) (* i y) X (* + y) = (* X y) + (* x y) + 2(|* x |>») + (J* x jy). 

1.44 Let x ■ y = (|* x |.y) — (J,* x fy) be the vector product of * and y for 
vectors of dimension three. Show that 

(a) this agrees with the usual definition [Margenau and Murphy (1943)]. 

(b) x y = -(yx) 
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(c) x • y is perpendicular to x , that is, x + (x 
[x = 2\x for a vector of dimension three.) 


y) = 0. (Use the fact that 


1.45 Let [a] = \/(x + x) be the length of x 9 let x y y = 


[x] x [y] 


be the cosine of 


the angle between x andjy, and let ^0*3^ = V l — (* y y ) 2 be the sine of the angle. 
Use the results of Exercises 1.43 and 1.44 to show that for space vectors 

(a) [x - y] = [x] x [y] x (x ay). Note that [x • y] is the area enclosed by the 
parallelogram defined by x and y. 

(b) (* y) ■ z = (* + z) x y -(y + z) x x 

(c) (*-;y) + z = x + (y ■ 2 ). 
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2 

MICROPROGRAMMING 


The algorithms to be executed by an automatic computer must be 
described in the restricted set of operations (called instructions or com¬ 
mands) provided in a given computer, and an algorithm so described is 
called a computer program . Since computer instructions are relatively 
complex, they may be described in turn by microprograms employing 
more elementary operations. 

Microprograms may be used to define a computer instruction set for the 
programmer, to define the detailed algorithms by which the computer 
circuits produce the operations of the instruction set, or for a variety of 
other purposes. In the design and development of a computer, for example, 
it is important to maintain precise and complete communication between 
the computer programmer, the computer (or system) designer, and the 
logical circuit (or hardware) designer. The system designer will, in fact, 
ordinarily begin with a description at the programmer’s level and proceed 
through increasing detail to the hardware designer’s level. Meanwhile, the 
programmers concerned with evaluating potential performance and with 
developing systems of metaprograms (so-called automatic programming 
systems) should be enabled to follow and to influence the evolving de¬ 
sign. 

The use of microprogramming will be illustrated by a description of the 
IBM 7090 computer (to be called the 7090) at a level approximately suited 
to the programmer and the system designer. The final section treats 
some problems in the extension to the hardware design level. 

The programs together with the lists of operands constitute a self- 
contained description of the 7090 which, to readers already familiar with 
computer organization and with the relevant sections (1.1 to 1.11 and 1.14) 
of Chapter 1, should prove readable without reference to the text. The 
text serves only to elucidate the microprograms and does not treat all of 
the instructions described by them. Tables 2.1, 2.13, and 2.14 summarize 
the dimensions, format, and significance of the various operands and 
should be consulted as each is first encountered. 0-origin indexing will be 
used throughout. 
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§2.1 


2.1 INSTRUCTION PREPARATION 

The operation of an automatic digital computer splits naturally into two 
phases which normally alternate: the instruction fetch and preparation and 
the instruction execution. The former involves the selection from some 
information store (memory) of the next instruction to be executed, its 
transfer to one or more control registers , and perhaps some modification 
of the instruction introduced into the control registers through so-called 
indexing, indirect addressing, or relocation. The execution phase begins 
with the decoding of the operation code segment of the instruction in the 
control registers to select the particular execution microprogram to be 
employed, and continues through the execution of the selected micro¬ 
program upon variables in certain central registers and in certain memory 
registers determined by the address portion or portions of the control 
registers. 

The main memory of the 7090 will be denoted by a logical matrix M of 
dimension 2 15 x 36. Selection from M is limited to the selection of a row 
each such row is called a word, and M 1 ' is called word i or register i. 


Dimension 


Memory 

M 

2 15 x 36 

Index accumulators 

I 


3 x 15 

Sequence vector 

s (instruction counter) 

15 

Command vector 

c 


36 

Upper accumulator 

u 

(s, q,p, 1,2,..., 35) 

38 

Lower accumulator (Quotient register) l (s, 1, 2,.. ., 35) 

36 

Upper accumulator overflow 

u 



Lower accumulator overflow 

l 



Trapping mode indicator 

t 

(0: normal 


Instruction fetch mode 

f ■ 

1: skip channel trap 
[2: skip trap and fetch phase 

fO: no indexing 



Indexing class 

k\c) 

1: normal indexing (15 bit) 
[2: restricted indexing (9 bit) 


Indirect addressing class 

k\c) j 

[0: no indirect addressing 
ll: indirect addressing 


Console start signal (run) 

r 



Binary representation of z 

pO) 



Table 2.1 

Central computer operands 
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Each instruction is a full word selected from M, and the sequence in 
which instructions are selected is controlled by a control register called the 
instruction counter or sequence register. This register represents a logical 
vector s of dimension 15 whose base two value determines the word / to be 
selected in the next instruction phase. The quantity* _\_s is incremented 
after each instruction fetch and therefore selects instruction words in 
natural ascending sequence. The value of s may, however, be respecified 
by the execution of certain branch instructions. 


c <-M^ 

J_*-2 15 |(l + J _s) 


Program 2.2 Basic instruction fetch 

The current instruction will be denoted by c. It is stored in a 36-bit 
command register. 

The basic instruction fetch involves only the variables M, s , and c, and 
is described by Program 2.2. The second step shows that the incrementa¬ 
tion of J _s is reduced modulo 2 15 and that the selection of instructions 
from the 2 15 word memory is therefore cyclic. 

Additive indexing 

It is convenient to the programmer to be able to add (or subtract) an 
index quantity i to (from) the address portion of an instruction in the 
command register c before its execution. This quantity is represented in 
base two by a logical vector a and is stored in a special index register. In 
the 7090, the data address portion of c is the fifteen-bit suffix to 15 /c and the 
indexing is subtractive: 

_Lco 15 /c 2 15 | (J_io 15 /c — J_a). 

The reduction modulo 2 15 again indicates cyclic treatment of addresses. 

The 7090 contains three index registers or index accumulators which may 
be used independently or jointly. They will be denoted by the index 
matri I of dimension 3x15. One or more (or none) of the index regis¬ 
ters V are selected according to the value of the vector i = (18 j, a 3 )/c, the 
three-bit index tag portion of the command, as follows: 

_Lw 15 /c «- 2 15 1 (JL«o“/c - ±(((18 | a 3 )/c) X I)). 

The address in the command register is clearly decremented by the base 

* Since number bases other than two will be used but rarely in the present chapter, 
the elided form _]_ x will be used instead of (2c) _]_ x . 
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two value of the vector obtained by on ng together the selected rows of I. 
The on ng of the index accumulators permits simple circuits for their 
selection. It is, however, of questionable value to the programmer, and 
the 7090 index registers are normally used individually. 

Indirect addressing 

It is often convenient (as in the execution of a permutation) for a 
programmer to specify a data address indirectly, i.e., to specify the address 
of the word containing the desired data address. Such indirect addressing 
could proceed through several levels, but in the 7090 it is limited to one. 


C 12 A C 13 • 0 
co 18 /c <-<x ) 18 /MJ- w15 / c 


Program 2.3 Indirect addressing 

Only the last half of c is respecified by the corresponding portion of the 
selected word, as described by Program 2.3. The occurrence of indirect 
addressing is determined by components c 12 and c 13 of the operation 
code. 

Dynamic relocation 

The correct execution of any computer program requires that each 
instruction and each operand be stored in the register assigned in the 
construction of the program. A program can, however, be relocated by an 
integral amount n if each word originally assigned to address j is assigned 
to address j + n , and if each address in the program is also incremented by 
n. The incrementation of program addresses can be performed explicitly 
by an assembler or other metaprogram, or it can be performed dynamically 
by an additive index register containing the number n. An index register 
employed exclusively for this purpose is called a base address register . 

More generally, the provision of a table of base addresses permits 
independent dynamic relocation of different blocks of a program, where 
each block is confined to a set of successive registers. This is equivalent to 
one-level indirect addressing in which a certain portion of the address 
(e.g., vfci o 15 /c)) selects from memory one of a table of base addresses to 
respecify the same portion t>/(co 15 /c) thus: 

i>/'(fa> 15 / C ) iY(cj 13 /M 1v/( “ 15/c) ). 

If, for example, v = a 7 , and the format is otherwise as in the 7090, then 
columns 21-27 of registers 0 to (2 7 — 1) provide the base addresses for 
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successive blocks of 2 8 registers each. The 7090 provides no dynamic 
relocation. 

Branching, interruption, and trapping 

The normal sequence of instructions (fetched from successive memory 
registers) can be interrupted by respecifying the sequence register s. Such 
respecification is performed in the execution phase of certain instructions, 
primarily those called transfers , and skips. The simplest branch is the 
TRA* (transfer), whose execution effects the following operation 

s co 15 /c. 

The normal sequence can also be broken by the insertion of an instruc¬ 
tion in the command register without disturbing the sequence register. 
Unless the inserted instruction is itself a branch, the normal sequence is 
resumed immediately. 

If just before a branch (or insertion) the present value of s is stored in 
some chosen memory register /, then the data in register i can be used in a 
subsequent branch to reestablish the original sequence at the point reached 
before the first branch. The storage of s and immediate branch are jointly 
called an interruption. An interruption which is performed automatically 
upon the occurrence of certain special conditions is called a trap. A trap 
provides a convenient device for inserting in the normal program sequence 
a subprogram demanded by the occurrence of the special conditions. 

In the 7090, the so-called channel trap is controlled by an 8 x 3 logical 
matrix T whose elements are determined by three different conditions 
existing in each of the 8 input-output channels of the computer. A corre¬ 
sponding enable matrix E (also 8x3) and an enable toggle e determine 
which elements of T are effective. 

The channel trap is effected in the first phase of the instruction fetch 
(Program 2.4) as described by steps 2-8. If the matrix eE a T is zero, the 
branch on step 2 skips the trap operation and begins the normal fetch on 
step 9. If not, step 3 determines j as the index of the first nonzero row, 
step 4 stores s in a memory register determined by y, and step 5 stores the 
nonzero row (which indicates the particular condition causing the inter¬ 
ruption) in another portion of the same register. Step 6 resets the indi¬ 
cators which occasioned the trap. Step 7 resets the enable toggle e and 
hence (as is clear from step 2) prevents the occurrence of further traps 
until e is again set to one by the execution of a special enable instruction 
ENB. (The reset of e to zero prevents the uncontrolled interruption of 
interruptions.) Step 8 performs the actual insertion by transferring to the 

* 7090 instructions will be referred to by the mnemonic codes used in the IBM Manual 
(1960). 
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command register the content of a second memory register determined by 
j. The sequence register is undisturbed. 

The 7090 can also be operated in a special trap mode which effectively 
converts all transfer instructions (but not skip instructions) into inter¬ 
ruptions. Discussion of this type of trapping will be deferred since it is not 
relevant to the instruction fetch phase. 

Complete instruction fetch 

The complete instruction fetch comprises three phases: channel 
trapping, the fetch proper, and the instruction preparation by indirect 
addressing and indexing. They are described by steps 2-8,9-10, and 11-18, 
respectively, of Program 2.4. 

Certain of the three phases may be skipped according to the setting of 
the fetch mode indicator f In the normal case (/ = 0) none are skipped. 
If/ = 1, the trap phase only is skipped. This case occurs after execution 
of an instruction such as the RDS (read select), which must be followed by 
a certain auxiliary instruction within a fixed time limit. If/ = 2 (a case 
which occurs only after execution of the XEC (execute) instruction), the 
trap and the fetch proper are both skipped, and the command already in c 
is merely prepared by indexing and indirect addressing. In every case,/ 
is reset to its normal zero value by step 11. 

Not all instructions are subject to indexing. The indexability of a 
command c is determined by a class function k°(c), which assumes the values 
0, 1, or 2, according to whether c is subject to no indexing, normal indexing 
(affecting all fifteen bits of the address), or restricted indexing (affecting 
the last nine bits of the address), respectively. This behavior is determined 
by the branch on step 13. 

A second class function k\c) determines whether the instruction c is of 
a type subject to indirect addressing. Actual indirect addressing of any 
particular instruction of the appropriate type is initiated by the configura¬ 
tion c 12 = 1 and c 13 = 1. The function k\c) is itself independent of c 12 
and c 13 . 

The class functions k°(c ) and k\c) will themselves be specified by 
prefacing the mnemonic code of each instruction described by a pair of 
digits. Thus, 

11 CLA 

indicates that clear and add is subject to both indexing and indirect 
addressing, and 

10 CHS 

indicates that change sign is subject to indexing but not to indirect ad¬ 
dressing. 
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i = (18 ja 3 )/c 

Program 2.4 Complete instruction fetch 

The phases of instruction preparation are performed in the following 
order: 

indexing (if indicated); indirect addressing (if indicated). 

Moreover, if indirect addressing is performed, the new address is itself 
re-indexed (if indicated). As shown by steps 12, 16, and 18, the indirect 
addressing is limited to a single level. 

2.2 INSTRUCTION EXECUTION 

The execution phase begins with the “decoding” of the operation part 
of the command c to select the appropriate microprogram to be executed. 
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Except for the format of the operation code* (which in the common case 
occupies the prefix a 12 /c) the details of the decoding are, however, of no 
interest to the programmer, and attention will be confined to the execution 
microprograms. These may be grouped into a small number of families; 
for the 7090 they are load and store , branch , logical , arithmetic , shift, 
convert , and input-output. 

Certain of the arguments and results of the computer instructions are 
represented by three central data registers to be denoted by /, u , and d. 
The registers u and l serve as accumulators in the addition and other 
arithmetic operations, and, since u and / jointly represent double precision 
numbers (i.e., carries are in some operations propagated between the high 
order end of / and the low order end of u ), they will be called the upper and 
lower accumulator, respectively. Since / receives the multiplier in a 
multiplication and the quotient in division, it is called (in the 7090 manual) 
the Multiplier-Quotient or MQ register, and the letter Q occurs in the 
mnemonic code for instructions affecting it. 

Signed numeric quantities are represented in base two with the sign in the 
first component, i.e., register i represents the quantity y = (1 — 2 Mf) x 
(_La 1 /M i ). The lower accumulator / is, like each memory register, of 
dimension 36, and the sign of a numeric quantity is represented by / 0 . 
The upper accumulator is of dimension 38 and represents the number 
(1 — 2 u 0 ) x (jLa 1 /^)- The two extra components u x and u 2 are called 
overflow positions and are excluded from normal transfers of data from u 
to the memory. The component u 2 (called the p- bit) is, however, included 
instead of the sign bit u 0 in certain logical instructions. The component 
u x (called the ^-bit) is made accessible only by certain shift operations. 

The register d (distributor) serves only as intermediary in transfers 
between main memory and the central data registers u and / and is not 
accessible to the programmer. 

Load and store 

In each member of the family of basic load and store instructions 
(Program 2.5), the memory word involved is selected by co 15 /c, the address 
portion of the instruction. The instruction STA stores only the address 
part of u, and STD stores the decrement part, so called because it is used 
in certain instructions to specify the amount of decrement to be applied to 
an index register. The STP stores the p- bit and the first two digits of the 
magnitude part of u ; that is, the three-bit prefix of the logical part of u 
which enters into logical operations. The STO instruction stores the 

* The operation code representing instruction a; is a logical vector to be denoted by 
p(a). Thus p(CLA) = (000101000000). 
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Load Q 

11 LDQ-» 

l M -L“ w / C 


Store Q 

11 STQ —* 

l 


Store left half Q 

11 SLQ -> 

a 18 IM- Li * lb l c +- a 18 // 


Store 

11 STO -> 

M J-“ 15 / c <- ( J. a 2 )/u 


Store zero 

11 STZ -> 

M-L" 15 / c «- 0 


Store address 

11 STA -> 

a> 15 /w 


Store decrement 

11 STD -» 

(3 | a 15 )/M-L“ 15 / c (5 | a 18 )/u 


Store prefix 

11 STP —> 

a 3 /M- Lw ‘ 6 / c <- (2 l a 3 )/a 


Store tag 

11 STT —> 

(18 i a 3 )/M-L“ 15 / c (201 a 3 )/u 


Store instruction 

location counter 11 STL -> 

w 15 /M-L“ 15 / c t- s 


Store logical word 11 SLW -» 

M-J-" 16 /c -_gi»/« 



Program 2.5 Load and store instructions 


normal numeric part of u (that is, all but the overflow bits), whereas SLW 
(store logical word) stores the p -bit instead of the sign. 

The instructions which load and store the index accumulators (Program 
2.6) are of four types, as indicated by the leading letter of each of the 
mnemonic codes, L for load index from memory, S for store index in 
memory, A for load index from the address of the command register, and 
P for place the index in the upper accumulator or the upper accumulator 
in the index. The portion of memory, command register, or upper 
accumulator involved in each of the ten instructions which specify the 
index is shown in steps 1-10. The last five of these differ from the corre¬ 
sponding members of the first five only by complementation on 2 15 , as 
shown in step 11. Since the subtraction occurring in indexing (step 14 of 
Program 2.4) is reduced modulo 2 15 , the effect of complementation is to 
add rather than subtract the quantity used to load the index accumu¬ 
lator. 

Step 12 shows that the index accumulators specified are selected by the 
three-bit tag vector i = (18 [ a 3 )/c and that each receives the same specifying 
quantity. Since the tag vector is used to select the index registers to be 
specified, it cannot also be used to specify indexing of the instruction 
itself, and, consequently, none of the load and store index instructions are 
indexable. Neither do they permit indirect addressing. 
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Load index from address 

Load index from 
decrement 

Address to index true 

Place address in index 

Place decrement in index 

Load complement of 
address in index 

Load complement of 
decrement in index 

Address to index 
complemented 

Place address in index 
complemented 

Place decrement in 
index complemented 


Store index in address 


Place index in address 



(i = (18 I a»)/c) 

Program 2.6 Load and store index instructions 
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The last four steps show the storing of the index accumulators. The 
quantity stored is the or function of the accumulators selected by the tag 
(18 | a 3 )/c. 

Branch instructions 

The basic branch instructions are of two main types, the transfer 
. denoted by a leading T in the mnemonic code) and the skip. The behavior 
of the skip instructions is shown in steps 1-10 of Program 2.7, and is 
typified by the PBT (/7-bit test) of steps 1 and 10. If the p -bit of the upper 
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accumulator is not zero, the sequence vector is incremented so as to skip 
the next instruction in the sequence; if the p -bit is zero, the instruction has 
no effect. The various skip instructions differ in the particular tests made, 
and the last two (CAS and LAS) differ also in providing three alternatives, 


/7-bit test 

Low order bit test 

Storage zero test 

Storage nonzero 
test 

Compare accum¬ 
ulator with storage 


Logical compare 
accumulator with 
storage 


Execute 


Store location and 
trap 


Leave trapping 
mode 

Enter trapping 
mode 

Trap transfer 

Console clear or 
reset 



Program 2.7 Skip type and other special branches 


skipping 2, 1, or 0 instructions according to whether the quantities com¬ 
pared stand in the relation <, =, or >, respectively. 

When operating in the nontrapping mode (/ = 0), the essential operation 
of the transfer type of branch (Program 2.8) is the (conditional) respecifi¬ 
cation of the sequence vector s by the address portion of c. The HTR 
(halt and transfer) also suspends operation of the computer until a run 
signal is received from the console (steps 27-29). 

In the trapping mode, all transfer operations are converted to inter¬ 
ruptions; the sequence vector is first stored in register zero, and a 
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Transfer on index high 

Transfer on index low 
or equal 

Transfer on index 

Transfer on no index 

Transfer with index 
incremented 

Transfer and set index 
Transfer 

Halt and transfer 

Transfer on zero 

Transfer on nonzero 

Transfer on plus 

Transfer on minus 

Transfer on overflow 

Transfer on no over¬ 
flow 

Transfer on Q plus 

Transfer on Quotient 
overflow 

Transfer on low Q 

(i = (18 l a 3 )/c) 


00 TXH 
00 TXL 

00 TIX 


00 TNX 
00 TXI 

00 TSX 
11 TRA1 

11 htr) 

11 TZE 
11 TNZ 
11 TPL 
11 TMI 
11 TOV 
11 TNO 

11 TQP 
11 TQO 

11 TLQ 



Program 2.8 Basic transfer type branches 
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(conditional) branch is made to register one. This behavior is useful in 
trace programs. The conditions for the various transfers are indicated 
in the setting of the logical variable b. Special indicators (such as the 
overflow toggle u) are reset by the transfer instructions which they control. 

Transfers based on the condition of the index accumulators (steps 1-9) 
are combined with modification of the index registers. The quantity from 
the index accumulators is again the or function of the accumulators 
selected by (18 j a 3 )/c. This quantity is compared with the decrement part 
of the command (that is, (3 j a 15 )/c)to control the conditional branches and 
is decremented or augmented by the decrement part to modify the selected 
index accumulators. 

The TSX (transfer and set index) inserts the complement (on 2 15 ) of the 
sequence vector into the selected index accumulators before effecting an 
unconditional transfer. This instruction is convenient for incorporating 
closed subroutines or other interruptions, since a subsequent TRA 
(transfer) with a zero address and indexed by the same index register 
restores the program sequence to the point of interruption. 

As shown in Program 2.7, only the TTR (trap transfer) is exempt from 
trapping. The trap indicator is set by the ETM (enter trap mode) and is 
reset by the LTM (leave trap mode) as well as by a console clear or reset. 
The XEC (execute) instruction performs no operation upon the central 
data registers but inserts in the normal instruction sequence (without 
breaking it) the instruction in the register specified by the data address 
accompanying the XEC. This is effected by simply loading the specified 
register into c and (by setting/ = 2) skipping the trap and the fetch proper 
of the instruction fetch phase. 

Logical instructions 

The logical operations (Program 2.9) concern the logical part of u, 
which differs from the numeric part by including the p -bit rather than the 
sign, and hence comprises a 2 /u. The first instruction of the family (ORS) 
produces the logical or of the word selected from storage with the vector 
a 2 /u and returns the result to the same location in storage. The instruction 
ANS (and to storage) is similar. In the ORA (or to accumulator) the 
result u is of dimension 38, and the second operand (that is, M lu>15/C ) is 
expanded to this dimension by inserting two leading zeros before oring 
it with u. The instruction ANA is similar. It is easily verified that ORA 
leaves the extra bits unchanged and that ANA resets them. The ERA 
(exclusive or to accumulator) is anomalous in that it resets the extra bits. 

The ACL (add and carry logical) is normally used only in certain parity 
check algorithms. It adds the selected word to the logical part of u , 
treating both as 36-bit positive base two numbers, again adds any resulting 
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Or to storage 
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(5 2 /u) v 
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ORA — > 
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a 2 /« — 0 

Complement magnitude 

10 

COM — > 

a 1 /u -<-a 1 /u 

Clear magnitude 

10 

CLM 

a 1 /a 0 

Change sign 

10 

CHS —> 


Set sign plus 

10 

SSP —> 

u o 0 

Set sign minus 

10 

SSM —> 

u 0 1 

Store logical word 

11 

SLW —* 

*-a 2 /u 

Clear and add logical 

11 

CAL —* 

u — a 2\M-J-“ l6 / c 

Add and carry logical 

11 

ACL — > 

y <r- l(ot 2 /«) + 




±a 2 /a <- (2 36 \y) + (y > 2 36 ) 


Program 2.9 Logical 




overflow to the low order end, and places the result (which will not exceed 
2 36 — 1) in the logical part of u. The behavior of the remaining logical 
instructions is evident from Program 2.9. As shown by the class functions 
k° and k 1 , five of them do not permit indirect addressing.* 

Arithmetic instructions 

The description of arithmetic instructions will be illustrated by the family 
of fixed pointf add instructions shown in Program 2.10. The CLA (clear 
and add) transfers the selected memory register to u , resetting the two 

* Since each of these five instructions involves the accumulator only, the normal 
address portion co 15 /c does not represent an address, and its last three components are 
used in representing the operation code itself. The possibility of indirect addressing 
would suggest to the programmer the nonexistent possibility of specifying an arbitrary 
indirect address in co 15 /c. 

f The 7090 incorporates three arithmetic functions: addition, multiplication, and 
division, each of which may be performed in either a fixed (radix) point or floating point 
mode. 
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Round 11 RND 
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Program 2.10 Add instructions 

overflow positions. The CLS (clear and subtract) differs only in that the 
sign is reversed in transfer. 

The instructions ADD, SUB, ADM, and SBM each transfer the 
selected word to d with an appropriate sign, add it to the number repre¬ 
sented by u (including the overflow positions), and place the sum reduced 
modulo 2 37 in u. The sign of a zero result is (as indicated by step 12) the 
sign of the number originally contained in u. 

The overflow indicator u is set only by a carry (but not a borrow) from 
u 3 to u 2 . This indicator controls, and is reset by, certain branch in¬ 
structions. 

The RND (round) instruction is used to round up the magnitude of the 
number represented jointly by the upper and lower accumulator by one 
unit in the high order position of the lower accumulator. As shown in 
Program 2.10, the content of the upper accumulator only is actually 
affected. 

Shift instructions 

In describing computer instructions, the term left shift of a vector x by r 
places refers either to the left rotation x r f x or to the left rotation 


« «- u 2 

±.a l /u «<- 2 37 | (|z|) 
u <- u V (0* ± ti 2 ) A (W 0 = d o )) 
M 0 ( z < 0) V («o A (Z = 0)) 


d <-( a 1 A M l0il5 / C ) 


d a 1 A M- 1<a,15 / c 


d a 1 v M^ l& l c - 6 

d u 0 a 1 v - 7 

z<-{ 1 - 2u 0 )(J_a 1 /«) + (1 - 2 d 0 )(±&ld) 8 


Clear and 
add 

11 

CLA —> 

u ( l a 2 )\M^ u l c 

Clear and 
subtract 

11 

CLS —> 

u -h-(| a 2 )\(a 1 ^ M-L“ : 
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11 

ADD—> 

d *- M-™ 1 *! 0 
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combined with a reset to zero of the last r “vacated positions,” that is, 


x (r f #) A co r . 

Both types of shift occur in the 7090 and concern various portions of the 
entire accumulator u @ /, as shown in Program 2.11. The portion affected 
is determined by the mask vector m. 


Accumulator left 


shift 
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Long left shift 
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Logical left shift 
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Program 2.11 


u u V (« r X ( m /( M © 0) 

m/(u © I) (r f (m/(u © /))) A w r 

m <- a 1 ^) © c(36) 
m — a 1 ^ 8) © a 1 (36) 
u 0 f 0 

m <- a 1 (38) © e(36) 

m <- a 1 ^) © c(36) 
m +-*\3S) ®a\36) 

*o u o 

m +- a\38) © €( 36 ) 

m/(« © Z) <- (r | (m/(u © /))) A a r 

/ f l 

( | « 2 )/« — * 

( j a 2 )/u <-0 

a 2 /u <-> / 
a 2 /u <- 0 


Shift instructions 


The first three instructions are left shifts. Each sets the accumulator 
overflow indicator if any nonzero bits are lost, i.e., if any of the first r 
positions of the affected portion are nonzero. The next three are analogous 
right shifts, which do not, however, set the overflow indicator. In the 
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“long” shifts LLS and LRS, one sign position specifies the other, although 
the sign positions are otherwise excluded from the shift by the mask m. 

The LGR shifts all positions save the sign of u\ RQL rotates MQ 
without resetting any positions; and XCA, which “exchanges” the 
accumulators, is effectively a rotation except that it resets the overflow 
bits. The amount of shift r is in each case determined by the base two 
value of co 8 / c. 

Convert instructions 

Each convert instruction (Program 2.12) selects a six-bit infix of one of the 
accumulators, adds its base two value to a “base address” initially specified 
by the address portion of the instruction, and selects the memory register 
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Program 2.12 Convert instructions 
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specified by the resulting address. Part of the selected register is used to 
respecify the base address and part to modify one or other of the accumu¬ 
lators. The process is reapplied to successive six-bit infixes in cyclic order 
a number of times determined by the base two value of (10ja 8 )/c. If 
c 20 = 1, the last fifteen bits of the last word selected in the operation are 
transferred to index accumulator 1°. 

Input-output instructions 

Because the data transmission rates of input-output equipment serving 
a computer are much lower than those of the computer proper, computer 
systems are normally designed to permit concurrent operation of the 
computer and one or more input-output units. The units are therefore 
more or less autonomous. 

In the 7090, the autonomy of input-oiitput equipment is carried further 
by providing eight data channels which transmit data to, and are controlled 
by, the computer proper, and which in turn control and accept data from 
the individual input-output units. The entire input-output process there¬ 
fore comprises three levels of microprograms: a semiautonomous input- 
output unit controlled by a semiautonomous data channel controlled by 
the computer proper. 

Attention will be restricted to the magnetic tape input-output units of 
the 7090. Each unit is available to one specific data channel i (for / = 
0 — 7), and a particular unit can be characterized as the file O/. The unit 
is completely autonomous only in certain simple operations, such as 
rewind , write end of file , backspace , and continue to end of record. Except 
for these special operations, a given data channel can control only one of 
its files at a time. The eight data channels may, however, operate con¬ 
currently under control of the computer proper. 

Each channel i behaves as a subcomputer with its own sequence vector 
S\ command vector C\ data register D\ and other miscellaneous operands, 
as shown in Table 2.13. The instructions of the subcomputer (listed in the 
matrix K ) are called channel commands and differ from the instructions of 
the computer proper in both format and function. 

Tape Units. Each tape unit behaves as a file (Sec. 1.22); each recorded 
component is an alphanumeric character represented in a seven-bit odd- 
parity error-detecting code, the lowest level partition X 0 is represented by 
the intercharacter space on the tape, the second level partition \ (called 
an end of record gap) is a longer blank space on tape. 

Each record gap is immediately preceded by a parity check character 
which is appended to the normal data of the record to permit an even 
parity “longitudinal” parity check on each of the seven-bit positions of the 



Dimension 


Channel data registers 

D 




8 x 36 

Channel sequence vectors 

S 




8 x 15 

Channel command vectors 

C 




8 x 36 



Tj : End of file X 2 


Channel trap 

T< 

ly : Parity check 

8x3 




ry : Channel command 


Channel trap enabled 

E 




8x3 

Channel trap enabled 

e 





Tape position limits 

L 

L* : (Beginning, End) 

8x2 

Limit position on tape 

V 

(Determined by reflective marker) 


Busy indicator 

b 




8 

Write or read indicator 

IV 




8 

Tape unit index 

t 




8 



0 

i : Normal read-write 


Functions 

f < 

1 

: Backspace record or write end of 




file 


8 



2 

: : Backspace to file mark 




* 

i : Rewind 


Load channel waiting 

r 

(reload) 

8 

Write record gap (Xj) next 

g 




8 

Current character 

X 

X i is the 7-bit representation 

8x7 

Current parity check 

Y 




8x7 

Interlock vector 

X 

\ 

<i = 1 if character X { is loaded 

8 

Current character selector 

V 

V i ID i is current character 

8 x 36 

End of file indicator 

Q 

Q i : (Counter, Potential error) 

8x2 

Input-output indicator 

h 








/ cd \ 

IOCD 





M 

TCH 





r p 

IORP 





r t 

IORT 


Channel commands 

K 








c p 

IOCP 





c t 

IOCT 





1 s pj 

IOSP 





\ s t / 

IOST 
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preceding record. The check character is recorded automatically and when 
read from the tape is used in the parity check but is not transmitted with 
the data of the preceding record. 

The third and highest level partition (called end of file) is represented by 
a special recorded character X 2 which has the seven-bit representation 
p(X 2 ) = (0, 0, 0, 1, 1, 1, 1). It is recorded together with the appropriate 
check character (which, since the check is of even-parity is also X 2 ) as a 
separate record. The character X 2 alone is not recognized as an end of file 
partition; only the sequence X 2 , X 2 , X x is so recognized. Tapes are 
normally stopped only at a record gap so that, on restarting, the tape is 
fully accelerated before the end of the record gap (and hence data) is 
reached. 


Dimension 


Character buffer 

Z 


Partition buffer 

P 


Logical association (connection) 

A 


Busy indicator 

B 

l 8 x # of units per channel 

Write-read status 

W 


Function status 

F 


End of file counter 

R 

(X 0 intercharacter gap 

File partitions 

A 

{Tq inter-record gap 


U 2 end-of-file symbol (0001 111) 
Table 2.14 Input-output unit operands 

The tape unit parameters are listed in Table 2.14, and the operation of 
tape unit O/ is described by Program 2.15. The unit idles at step 5 until its 
busy indicator Bf is turned on by data channel i. After a starting delay of 
about 650 microseconds required to accelerate the tape and reach the 
beginning of the record, one of four functions (listed under /in Table 2.13) 
is performed as determined by the function indicator Ff. 

If Ff = 0, a normal read or write is performed under direct control of 
the data channel as detailed in steps 18-37. If Ff ^ 0, one of the several 
completely autonomous operations is initiated. If Ff is not zero and 
Wf (write indicator) is unity, the autonomous function write end of file 
is performed by steps 1-3, after which the busy indicator is turned off and 
the unit returns to idle status. The end limit indicator Lf is set by step 3 
if the tape position exceeds a limit v set by a reflective marker attached to 
the tape a short way before its extreme end. 

If Wf = 0 and if Ff = 1, 2, or 3, the unit backspaces to the next earlier 
record gap, to the next earlier end of file position, or to position zero, 
respectively. The last is called rewind. If, in backspacing, the tape becomes 
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rewound before the appropriate partition is found, the process is terminated 
and the beginning limit indicator LJ is turned on (steps 10-11). 

The file is read backward repeatedly by step 12. When a record gap 
occurs, step 16 is executed, and if Ft = 1, the branch to step 4 returns the 
unit to idle status. If Ft = 3, termination can occur only from step 11, at 
which point the tape is rewound. The counter Rt is used to detect an end 
of file partition. It is reduced by one (step 14) if the character read isX 2 or 
to zero if it is not. Since Rt is set to 3 after each record gap, step 17 is 
reached with Rt = 1 if and only if the end of file sequence X l5 X 2 , X 2 , X x has 
occurred. 



Before completing the discussion of the remaining functions of the tape 
unit, it may be helpful to follow through the entire process initiated by the 
BSR (back space record) instruction. The channel idles (Program 2.16) on 
step 5 with the busy indicator b { off. The BSR instruction (Program 2.17) 
first determines the index i of the channel addressed, waits on step 6 until 
the selected channel becomes free, sets the tape index t t — j to select the 
particular unit O/, the function indicator f i to unity, the write indicator w i 
to zero, and the busy indicator b i to unity. This last action initiates oper¬ 
ation of channel i, which, as soon as unit O/ becomes free (Bt = 0), executes 
steps 6-7 and then (since f i > 0) returns the channel immediately to idle 
status. Step 6 transfers to the parameters of the selected unit the relevant 
channel parameters which were themselves specified by the BSR instruction. 
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Step 6 also makes the selected unit busy (£/ = 1), and hence starts it in 
operation. 

The normal read-write functions of the tape unit are described by steps 
18-37. They are initiated by a branch from step 7 in the event that the 
function indicator is zero. Over-all control by the channel is exercised 
primarily through the association indicator Ap, which is equal to unity if 
unit j is logically associated with channel i, that is, if data are permitted to 
flow between them. If writing is in progress (Wp = 1) and A/ becomes 
zero, the unit stops; if reading is in progress, the channel is freed immedi¬ 
ately but the tape continues to a record gap. 

Step 20 performs the read from tape, and if A/ = 1, the partition read 
is transferred to the seven-bit representation of the character read is 
transferred to X 1 (both for use by the channel), and the channel-unit 
interlock x { is set to unity to initiate appropriate disposition of the 
character by the channel. If Pp is not a record gap reading continues, the 
intercharacter delay (not shown) permitting time for the channel to 
dispose of the character before the next is actually read. If Pp is a record 
gap, the corresponding delay elapses before Ap is tested. If Ap = 0, 
the branch to step 4 stops the unit. The tape stops only at a record 
gap although transmission of data may be discontinued earlier by step 
21 . 

The writing process begins at step 37 and may be discontinued before 
any writing occurs (although the current record gap will be lengthened by 
a few inches of blank tape). The main writing is performed by the loop 
28-32, employing the channel interlock Step 31 sets the tape end limit 
indicator. The loop terminates (step 32) when the write record gap 
indicator g { is set to unity by the channel. Steps 33-36 then write the 
longitudinal parity check character Y i supplied by the channel, together 
with the inter-record gap partition The write loop is then re-entered 
unless Ap = 0. 

Channel operation. Operation of a channel is initiated either by one of the 
special functions (WEF, BSR, BSF, REW) already described, or by a 
WRS (write select), or an RDS (read select). The loading of the channel 
command C i required to control the two latter functions is, however, 
controlled by a subsequent RCH (reset load channel), which transfers to 
S* the address in memory of the desired channel command. 

The WRS (Program 2.18) selects the channel / specified by a portion of 
its address, waits until the channel is free, sets its tape unit index t i as 
specified by another portion of the address, sets the write indicator to 
unity and the function indicator f t to zero, and, finally, sets to start the 
channel. The fetch mode indicator / is also set to one so as to skip the 
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channel trap on the next instruction fetch. This prevents a trap from 
intervening between the WRS and the following instruction (which is 
normally an RCH). The RDS differs only in the setting of w i on step 4. 

If the channel is not busy (i.e., not selected), the RCH instruction 
(Program 2.18) selects the channel specified by a portion of the operation 
code, sets the input-output indicator h , and copies the instruction address 
to the channel sequence vector S 7 . If the channel is busy, the RCH 
instruction sets the selected channel to its step 9, whereupon the channel 
waits on the interlock at step 10. Meanwhile, step 18 of the RCH sets 
S 7 and step 19 sets the interlock r i so that the channel may proceed. 

Steps 13 and 14 of the channel operation load the channel command 
register C 7 and increment the channel sequence register S 7 . If the command 
is a TCH (Transfer in Channel), step 16 causes a branch to a new sequence 
of commands. If not, the word count , represented by (3 [ a 15 )/C*‘, is 
tested. If it is zero and if the current command is either an IOSP or IOCP, 
the branch to step 13 immediately fetches the next command in sequence. 
Otherwise, indirect addressing of the command occurs (step 19) unless 
C \ 8 is zero. 

Step 20 specifies k according to the class of the command being executed. 
The commands are listed in the matrix K of Table 2.13. 

The first component Kf assumes the value c, r, or s according as the 
command having code i is terminated by a word count test, a record gap, 
or by either (signal). The second component Kf assumes the value d, p, 
or t according as the channel discontinues operation, proceeds to the next 
command in sequence (as determined by S *), or transfers to an LCH 
(Load Channel) instruction which may be awaiting execution by the 
computer proper. Execution of the LCH (Program 2.18) is delayed at 
step 11 and branches to step 18 (to respecify S 7 in the manner of the RCH) 
only if the channel reaches step 3. 

Channel operation continues on the right-hand segment (steps 35-65) if 
the operation is a read (w i = 0), and on the left (steps 22-34) if it is a 
write. In the latter case, a zero word count causes immediate termination 
of the current command. 

The normal termination of a command in either read or write mode 
occasions a branch to step 1, where the tests for continuation begin. Step 
1 sets the Channel Command Trap indicator T 2 7 if the current command is 
of the transfer type and an LCH (Load Channel) is not awaiting execution 
in the computer proper. If the command is of the proceed type, step 2 
branches to step 13, where the next command in sequence is fetched. If the 
command is of the transfer type and an LCH is waiting = 1), step 3 
branches to step 9 to reset parameters and permit the channel to be reloaded. 
In all other circumstances step 4 is executed to disassociate the unit from the 
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channel and to return the channel to idle status. In read status, certain 
abnormal events—the occurrence of a parity error, or an end of file 
partition—return the channel to idle status immediately, regardless of the 
type of command being executed. 

The write operation (steps 22-34) is relatively simple. If the word 
count in (3 [ a 15 )/C* is zero, steps 23-24 terminate the current command 
but first initiate the writing of an end of record gap* if the command is of 
the “record” type (e.g., an IORP). If the word count is not zero, step 25 
transfers to the channel data register D l the memory word selected by the 
address portion of the command. The loop 28-33 transfers to the tape 
unit successive six-bit infixes of D l and maintains the longitudinal parity 
check Y i (originally reset on step 12). When all six have been transferred, 
the branch to step 34 decrements the word count and, unless it becomes 
zero, repeats the entire process from step 25. 

The read operation (steps 35-65) begins by resetting D l to zero and the 
infix selector V i to a 6 . Step 37 terminates the current command if it is of 
the count or signal type and the word count is zero. Steps 38-39 terminate 
the command if it is of the record or signal type and if the last file partition 
read is a record gap. The partition indicator p t is reset to X 0 by step 39. 
Thus a record gap present when termination is caused by a zero count is 
still present on the first execution of the succeeding command, whereas a 
gap which itself causes termination is not present on the succeeding 
command. 

Steps 40-43 show the data interlock, the determination of the longi¬ 
tudinal parity check, and the setting of the parity error trap Tp in the 
event of a parity error in the character. If the corresponding channel trap 
is enabled, step 44 causes immediate termination in the event of a parity 
error. Steps 45-48 detect an end of file configuration (using a counter, 
Oo l in a manner similar to that used in Program 2.15), set the indicator 
Oi if a partition character X 2 appears at the beginning of a word, and 
cause termination (from step 49) with the end of file trap T 0 * set if an end 
of file configuration occurs. If the character X 2 occurring at the beginning 
of a word is not part of an end of file configuration, step 50 sets the tape 
error trap T x \ and step 51 causes termination if the corresponding channel 
trap is enabled. 

Steps 53-56 are executed only if p { is a record gap. They reset the 
counters Q l controlling the end of file test, test and reset the longitudinal 
parity vector Y\ and may cause termination in the event of an error. Step 
57 causes the character transfer of step 58 to be skipped if the character is 

* Since the partition is represented by a gap, the writing of one gap immediately 
following another, with no intervening data, has the effect (when subsequently read) of a 
single record gap. 
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Input-output test 


Beginning of tape test 
End of tape test 


Transfer on channel 
end of file 


Transfer on channel 
redundancy 


Transfer on channel 
in operation 

Transfer on channel 
not in operation 



Program 2.19 Input-output branch operations 
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a potential end of file. Steps 62-65 decrement the word count and transfer 
completed words to successive memory locations unless Cj 9 is zero. Step 
61 suspends these operations when the word count reaches zero. Since 
step 56 is followed by step 61, the occurrence of a record gap occasions the 
(potential) transfer of a word to memory even though it is incomplete. 
Because of the reset of D 1 on step 35, the incompleted part of the word is 
zero. 

Auxiliary channel instructions. Program 2.19 shows those branch instruc¬ 
tions which are controlled by indicators associated with the data channels. 
Each indicator tested is also reset. The last four instructions shown are 
subject to the trap mode. 



Program 2.20 Trap control and store channel 


The channel indicators T may also cause interruptions as detailed in the 
instruction fetch phase. They are controlled by the enable matrix E and 
the enable trigger e which are set by the ENB (enable) and RCT (reset 
traps) instruction of Program 2.20. The instruction SCH (Program 2.20) 
permits storage of the channel registers. 
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2.3 DETAILED LOGICAL DESIGN 

Although a description couched at the programmer’s level specifies 
completely the functions of a computer, it requires considerable extension 
to provide a basis for the so-called logical design of circuits for realizing 
the computer. The extensions include: (1) the specification of sequence in 
the microprograms themselves; (2) further detailing of certain complex 
functions; (3) reduction of the number of operands (registers) required; 
and (4) economization in the underlying functions provided. The nature 
of these extensions will be indicated briefly. 

In principle, the problem of sequence control in the microprograms does 
not differ from the sequence control in computer programs. However, the 
function served by the sequence vector s (a base two representation of the 
address of the succeeding instruction) is frequently served instead by a ring 
or combination of rings. A ring is a logical vector r of weight one (that is, 
+/r = 1) capable of rotation (jr or |r) and of resetting to one of several 
initial positions p t (that is, r e Pi .) 

Certain steps of a microprogram, which at the programmer’s level may 
be considered as monolithic, must themselves be realized in circuitry as 
more detailed microprograms. The addition of two unsigned (positive) 
numbers represented in base two by the vectors x and y might, for example, 
be performed as in Program 2.21. The result x produced is correct only if 
the sum is less than 2 V{X) . 

Economization in the underlying functions provided is achieved by 
restricting the “data paths” provided between the various operands (i.e., 
registers) and by restricting the operands to which certain operations 
apply. Restriction of data paths implies that it is not possible for each 
operand to specify every other operand directly. For example, memory 
may be restricted to communicate only with the buffer register d so that 
any transfer from memory such as 

c<-M^ s 


y : 0 

z y 

y *- t (* A z) 

X <- (x ^ z) 


Program 2.21 Base two addition 
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Figure 2.22 Data paths 


must in fact be performed in two steps: 

d M ls 
c <- d. 

An operation such as address decoding (i.e., conversion of the normal 
base two representation of an address / into a one-out-of-fl code of the 
form e* suitable for selecting word i from memory) is relatively costly and 
is not normally provided for all relevant operands (such as s and c o 15 /c in 
the 7090). Instead, decoding may be provided on a single auxiliary 
operand a; the selection of an instruction in the 7090 would then be 
executed in two steps: 

a+- s 
c M la . 

All microprograms specified at the programmer’s level must, of course, 
be translated into equivalent microprograms which satisfy the path 
constraints. Path restrictions are perhaps best displayed as a “block 
diagram” showing the data paths provided between the various registers 
and operations units. Figure 2.22 illustrates a convenient representation in 
which the paths are shown as connecting lines with arrowheads indicating 
the possible directions of transfer. If the indicated paths are further re¬ 
stricted to selected components of the operands, this restriction may be 
indicated by a pair of selection vectors separated by a colon. Thus the 
notation 

co 15 (36) : e(15) 

on the path between d and s of Fig. 2.22 indicates that transfers occur 
between co 15 /d and s. The symbols r and c denote the selection of a matrix 
row and a matrix column, respectively, as illustrated by the path between 
M and d. Permutations may be represented in the form pj. Thus if the 
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vector d were to be transposed (reversed in order) in the transfer to c, the 
path would be labeled with the expression 

Pj : €, 

where p = |i~ 35 (36)| = (35, 34, . . ., 1, 0). 
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EXERCISES 

2.1 Write 7090 programs for each of the following operations: 

(a) M^M^vM 9 

(b) M8 <- M8 

MM 6 2 36 | (J_M8 + _L M») 

^ [ A- M1 *- ((J_M8 + J_M 9 ) > 2 36 ) 

(d) M« /M 7 , a 6 , M 9 / 

(e) Af« /M 7 , to«, M 9 / 

(f) M 6 /M 7 , (8 J. a 10 ), M 9 / 

(g) M 6 <- /M 7 , M 8 , M 9 / [Use /*, a, y/ = (x A «) V O A «)] 

(h) M« — tijoi 1 

2.2 In the magnetic core technology employed in the 7090, logical disjunction 
(or) and negation are much easier to produce than conjunction (and). Limiting 
the logical functions employed to disjunction and negation, write microprograms 
for the following 7090 instructions: 

(a) ANS (Use De Morgan’s law, Sec. 1.8) 

(b) ERA 

2.3 In the magnetic core technology used in the 7090, each transfer of a quantity 
y into a register x is actually an or with the present content of the register, i.e., 
x +-y v x. A register may also be reset to zero. Subject to the foregoing 
restriction, write microprograms for 

(a) the operation l *-M\ (Use two steps.) 

(b) the operations of Exercise 2.2(a). 

2.4 Describe the main portion of the instruction fetch of the 7090 (steps 9-18 
of Program 2.4) in an algorithm which satisfies the data path constraints of Fig. 
2.22. 

2.5 Repeat Exercise 2.4 so as to satisfy the constraints of Exercises 2.2 and 2.3 
as well. 

2.6 A vector p which permits only the following types of operation: 

(0 P *- \P, w 1 , *\ 
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is called a pushdown or stack vector. If the successive components of a stack 
vector p are represented by successive memory registers, then the operations 
affecting it can be controlled by a single address counter v , which is automatically 
incremented at each operation of type (i) (addition of a new final component) and 
is automatically decremented at each operation of type (ii) (reference to the final 
component accompanied by its deletion). 

(a) Using the 7090 registers and formats augmented by an address counter v 
of dimension 15, write a microprogram for an operation LDS (load stack) 
which transfers M- 1 ™ 15 ! 0 to the top of the stack [operation type (i)]. 

(b) Write a microprogram for STS (store stack) which transfers the top of the 
stack to M lwl5 / C [operation type (ii)]. 

(c) Write a microprogram for an operation AND which produces the and 
function of the top two components of the stack, deletes them, and appends 
the result as a new final component. [The net reduction in v(p) is one]. 

(d) The AND of part (c) has no associated address. Show that all 7090 
instructions (other than input-output) can be redefined so that only the 
LDS and STS require associated addresses. 

2.7 In the 7090, a decimal digit x is represented in direct binary coding in a 
six-bit logical vector x, (that is, _|_* = x ), and each register accommodates six 
decimal digits. Use the convert instructions (Program 2.12) in a 7090 program to 

(a) convert from binary to decimal. 

(b) convert from decimal to binary. 

(c) replace all leading zeros (i.e., all preceding the first significant digit) of a 
number represented in decimal. 

2.8 Write a 7090 program to convert 

(a) from a 36-bit binary code to a 36-bit reflected Gray code [see Phister (1958)]. 

(b) from a reflected Gray code to binary. 

2.9 A memory M is called a tag or associative memory if for any argument x 
it yields a direct indication of the row or rows of M which agree with x. If the 
resulting indication is in the form of a vector s such that the matrix s//M contains 
the indicated rows, then s = M ^ x. More generally, a logical mask vector m is 
added to the system so that m/M is compared with the argument mjx and some 
desired function of m/M fc is represented by m/M^ for each k. In the following 
exercises M is assumed to be a logical matrix. 

(a) Use De Morgan’s laws (Secs. 1.8 and 1.11 or Sec. 7.1) to derive from the 
relation s = M £ x an expression for s which would be suited to a circuit 
technology in which disjunction and negation are easier to perform than 
conjunction. 

(b) Write a detailed algorithm using a row-by-row scan of M to determine 
s = (m/M) a (mix). 

(c) Repeat part (b) using a column-by-column scan of M. 

(d) Use a column-by-column scan of M to determine s such that s//M contains 
the rows of M of maximum base two value [see Falkoff (1962)]. 
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VARIABLES 


3.1 ALLOCATION AND ENCODING 

Although the abstract description of a program may be presented in any 
suitable language, its automatic execution must be performed on some 
specified representation of the relevant operands. The specification of this 
representation presents two distinct aspects—allocation and encoding. 

An allocation specifies the correspondences between physical devices and 
the variables represented thereby. An encoding specifies the correspond¬ 
ences between the distinct states of the physical devices and the literals 
which they represent. If, for example, certain numerical data are to be 
represented by a set of 50 two-state devices, the two-out-of-five coding 
system of Exercise 1.6 might be chosen, and it would then remain to 
specify the allocation. The two-digit quantity “hours worked” might be 
allocated as follows: devices 31-35 represent components 1-5, respectively, 
of the first digit, and devices 29, 16, 17, 24, and 47 represent components 
1, 2, 3, 4, 5, respectively, of the second digit. 

The encoding of a variable will be specified by an encoding matrix C and 
associated format vector f such that the rows of f\C list the representands 
and the rows of f/C list the corresponding representations. The encoding 
is normally fixed and normally concerns the programmer only in the 
translation of input or output data. Even this translation is usually 
handled in a routine manner, and attention will therefore be restricted 
primarily to the problem of allocation. 

However, the encoding of numeric quantities warrants special comment. 
It includes the representation of the sign and of the scale, as well as the 
representation of the significant digits. Small numbers, such as indices, ad¬ 
mit not only of the usual positional representation but also of the use of the 
unit vector e j to represent the number j (i.e., a one-out-of-A7 coding system), 
or of the use of a logical vector of weight j (i.e., a base 1 number system). 

Allocation will be described in terms of the physical vector 7i, which 
denotes the physical storage elements of the computer. Each component 
of 7i corresponds to one of the v(n) similar physical devices available, its 
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range of values is the set of physical states achievable by each device, and 
its index is the address of the device. Each component of n may corre¬ 
spond to a computer register, an individual character position in a register, 
or an individual binary digit within a character, depending on the degree of 
resolution appropriate to the allocation problem considered. The 0-origin 
indexing normally used for computer addresses will be used for the physical 
vector, but 1-origin indexing will, throughout this chapter, normally be 
employed for all other structured operands. 

An index of the physical vector will be called an address and will itself 
be represented in the (perhaps mixed) radix appropriate to the given com¬ 
puter. The Univac, for example, employs base ten addressing for the 
registers, and (because of the use of 12-character words) a radix of twelve 
for finer resolution. The address of the fourth character of register 675 
might therefore be written as 675.3. In computers which have two or 
more independent addressing systems (e.g., the independent addressing 
systems for main memory and for auxiliary storage in the IBM 705), 
superscripts may be used to identify the several physical vectors n j . 

In general, the representation of a quantity a; is a vector (to be denoted 
by p(#)) whose components are chosen from the physical vector n. Thus 
p(x) = k$n, where k is a mapping vector associated with x. The dimension 
of the representation (that is, r(p(z))) is called the dimension of x in n. If, 
for example, p(a?) = (7r 10 , 7i 9 , 7i 17 , 7 t 18 ), then k = (10, 9, 17, 18), and the 
dimension of x in n is four. If p(x) is an infix of 7r, then the representation 
of x is said to be solid. A solid representation can be characterized by two 
parameters, its dimension d and its leading address f that is, the index in7i 
of its first component. Then p(a?) = (/ j a d )/7r. 

3.2 REPRESENTATION OF STRUCTURED 
OPERANDS 

The grid matrix 

If each component of a vector x has a solid representation, then the 
representation of the entire vector is said to be solid and may be charac¬ 
terized by the grid matrix T(x), of dimension v(x) x 2, defined as follows: 
IY(tf) is the leading address of p(^), and T 2 i (x) is the dimension of x { in n. 
If, for example, the vector x is represented as shown in Fig. 3.1 a, then 

In 2\ 

19 4 
27 5 . 

23 1 
\32 3/ 


r(*) = 



Physical vector 
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Figure 3.2 Linear representation of a matrix X 
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Any structured operand can first be reduced to an equivalent vector, and 
the grid matrix therefore suffices for describing the representation of any 
construct, providing only that the representation of each of its elements is 
solid. Thus a matrix X may be represented by either the row-by-row list 
r = E/X or the column-by-column list c = E//X, and a tree T may be 
represented by the left list matrix [T or the right list matrix ]T, either of 
which may be represented, in turn, by a vector. 

If a process involves only a small number of variables, it is practical to 
make their allocation implicit in the algorithm, i.e., to incorporate in the 
algorithm the selection operations on the vector n necessary to extract the 
appropriate variables. This is the procedure usually employed, for 
example, in simple computer programs. In processes involving numerous 
variables, implicit allocation may become too cumbersome and confusing, 
and more systematic procedures are needed. 

Linear representations 

The representation of a structured operand is said to be linear if each 
component is represented by an infix of the form (/ { a d )/n , where / is a 
linear function of the indices of the component. For example, the 
representation of the matrix X indicated by Fig. 3.2 is linear, with d = 2 
and / = — 11 + 5/ + 8/ 

A linear representation is solid and can clearly be characterized by a 
small number of parameters—the dimension d of each component and the 
coefficients in the linear expression /. The representation of a vector x is 
linear if and only if r 2 (*) = de and the difference S = IY(#) — Pi _1 (#) is 
constant for / = 2, 3, ..., v(x). 

If / = p + qi + rj is the function defining a linear representation of a 
matrix X and if a is the leading address of a given element, then the leading 
address of the succeeding element in the row (or column) is simply a + r 
(or a + q). Frequently, the succession must be cyclic, and the resulting 
sum must be reduced modulo v(X) x r (or /u(X) x q ). The inherent 
convenience of linear representations is further enhanced by index registers, 
which provide efficient incrementation and comparison of addresses. 

Linear representation of a structured operand requires that all com¬ 
ponents be of the same dimension in n. This common dimension may, 
however, be achieved by appending null elements to the shorter com¬ 
ponents. The convenience of the linear representation must then be 
weighed against the waste occasioned by the null elements. Moreover, if 
several vectors or matrices are to be represented and if each is of unspecified 
total dimension in 7i, it may be impossible to allot to each an infix suffi¬ 
ciently large to permit linear representation. Consequently, a linear 
representation is not always practicable. 
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Nonlinear representations 

Since the use of the grid matrix imposes only the condition of solidity 
for each component, it permits an allocation which is sufficiently general 
for most purposes. The grid matrix serves in two distinct capacities: (1) 
as a useful conceptual device for describing an allocation even when the 
actual allocation is implicit in the program, and (2) as a parameter 
which enters directly into an algorithm and explicitly specifies the allo¬ 
cation. 

If the grid matrix is used in a program as an explicit specification of the 
allocation, then the grid matrix must itself be represented by the physical 
vector. There remains, therefore, the problem of choosing a suitable 
allocation for the grid matrix itself; a linear allocation is illustrated by 
Fig. 3.16. 

If the grid matrix T(^) itself employs a linear representation, its use 
offers advantages over the direct use of a linear representation of x only if 
the total dimension of T in n is much less than the total dimension of x in 
7i when linear representations are employed for both. This is frequently 
the case, since each element of a grid matrix belongs to the index set of n 
(that is, to i°(r(7r))), and the dimension of each element in n is therefore 
both uniform and relatively small. Program 3.3 shows the use of the grid 
matrix r(#) and the encoding matrix C in determining the kth component 
of the vector x. 

Program 3.3. A linear representation is assumed for T(x), with element r/(*) 
represented by the infix ((p 4- qi 4- rj) [ cL g )ln. Moreover, each element of r(*) 
is assumed to be represented in a base b number system. Step 1 determines the 
leading address of the representation of r^*). Step 2 specifies f as the base b 
value of this representation, i.e., as the leading address of p(* fc ). Steps 3 and 4 
specify d as the dimension of x k in n, and step 5 therefore specifies z as the 
representation of x k . 

Steps 7-9 perform the decoding of z = $(x k ) to obtain 2 as the actual value of 
x k . Since this process is normally performed by human or mechanical means 
(e.g., a printer) outside the purview of the programmer, it is here expressed directly 
in terms of the encoding matrix C rather than in terms of its representation. The 
left-pointing exit on step 7 is followed only if z does not occur as an entry in the 
encoding matrix. 

The form chosen for the grid matrix is one of several possible. The two 
columns could, for example, represent the leading and final addresses of 
the corresponding representations or the dimensions and final addresses. 
The present choice of leading address / and dimension d is, however, the 
most convenient for use in conjunction with the notation adopted for 
infixes; the logical vector (/ j a d ) selects the appropriate infix. 
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0-origin indexing for n only 

p, q,r 

Constant, coefficient of row 
index, and coefficient of column 
index in the linear function for 
the representation of T(x). 

b 

Base used in representing ele¬ 
ments of T(x). 

s 

Dimension in n of each element 
of T(x). 

f 

Leading address of p(^). 

d 

Dimension of p(* fc ) in tt. 

z 


c 

Encoding matrix for com¬ 
ponents of x. 

f 

Format vector for C. 

z 

Character encoded by x k . 


1 

+-p + qk + r x 1 

/ 

— b-!-((/ j a 9 )In) 

l 

*-1 + r 

d 

+-b±((H aP)!n) 

z 

+-(fla d )ln 

h 

-MO + l 

h 

*-h - 1 

z 

: flC h 

z 



Legend 

Program 3.3 Determination of z = p(* fc ) and z = x k from a linear representa¬ 
tion of the grid matrix r(*) 

Chained representations* 

If a linear representation is used for a vector, then the deletion of a 
component (as in a compress operation) necessitates the moving (i.e., 
respecification) of the representations of each of the subsequent com¬ 
ponents. Similarly, mesh operations (insertion) and permutations 
necessitate extensive respecification. The use of a grid matrix T(x) 
obviates such respecification in x , since appropriate changes can instead be 
made in I\*), where they may be much simpler to effect. If, for example, 
x is the vector represented as in Fig. 3.1 a, and z is a quantity of dimension 
six in 7i, then the mesh operation 

* \*, € 3 , Z\ 

may be effected by specifying the physical infix (70 j a 6 )/rc by p(z) and by 

* Chained representations have received extensive treatment, frequently under the 
name “lists.” See, for example, Shaw et al. (1958) and Blaauw (1959). 
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respecifying T(x) as follows: 


Representation of structured operands 111 


F 2 \ 

19 4 


r(*) = 


70 

27 


23 1 
\ 32 3 / 


However, if the representation of I\#) is itself linear, then insertions, 
deletions, and permutations in x will occasion changes in all components of 
r(#) whose indices are affected. The need for a linear representation of the 
grid matrix (and hence for all linear representations) can be obviated by 
the use of a chained representation defined as follows. 

Consider a vector y, each of whose components y k has a solid represen¬ 
tation p (y k ) whose infixes (g l 0 L g )/p(y k ) and OL 9 lp(y k ) are, respectively, 
the dimension of p(y k ) in n and the leading address of the representation of 
the (cyclically) succeeding component of y (both in a base b system), and 
whose suffix a? 9 lp(y k ) is the representation of the kth component of some 
vector x. Then (the representation of) y is called a chained representation 
of x. In other words, the representation of y incorporates its own grid 
matrix (with the address column Tfy) rotated upward by one place) as 
well as the representation of the vector x. 

For example, if g = 2, b = 10e, and x = (365, 7, 24), then 


P(3h) — (*17> **i 8 > ^19’ ^ 20 ’ ^ 2 i> ^ 22 ’ ^ 23 ) — ( 6 > 8 , 0, 7, 3, 6, 5), 
pO^) = (7*68» ^69’ ^70> 7Z 7l’> ^ 72 ) = (2> 6 , 0, 5, 7), 

and 

P(^ 3 ) = ( Tr 26’ ^27’ ^28’ ^29’ ^30* ^Sl) = (1 j 6, 2, 4), 


is a suitable chained representation of x. 

The parameters required in executing an algorithm on a chained 
representation y are g, the common dimension in n of the elements of the 
grid matrix T(y); b , the base of the number system employed in their 
representation; and/and h, the leading address and index, respectively, 
of the representation of some one component of y. The parameters g and 
b are usually common to the entire set of chained representations in use. 
Program 3.4 illustrates the type of algorithm required to determine p(tf fc ) 
from a given chained representation of x. 

Program 3.4. The loop (1-3) is executed v(x) | 0 (A; — h) times, with the result 
that at step 4 the parameter/is the leading address of pCy fc ). Step 4 therefore 
specifies d as the dimension of p(>^), that is, as the base b value of iy/y). Step 5 
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1 

h - V(x) li (h + 1) 


2 

3 ► 

/<-6±((/J a. 0 ) In) 

h : k 


4 

d+-b±(( f +g)l a«)/n 


5 

z — (/J. <**)/« 


6 

P(*fc) — « 25 /2 



0-origin indexing for n only 

hj 

f is the leading address of the 
hi h component of the chained 
representation of x. 

b 

Base used for representation 
of the elements of the grid 
matrix. 

8 

Dimension in n of elements of 
the grid matrix. 

d 

Dimension in n of Arth com¬ 
ponent of the chained represen¬ 
tation of x. 

z 

Acth component of the chained 
representation of x. 


Legend 

Program 3.4 Determination of p(* fc ) from a chained representation of x 

then specifies z as pO^). Step 6 deletes those components of z which represent 
the elements of the grid matrix, leaving p(x/). 

The parameters / and h are themselves respecified in the execution of the 
algorithm so that h becomes k and/becomes, appropriately, the leading address 
of pCy /c ). A subsequent execution then begins from this new initial condition. 

The chained representation used thus far is cyclic and contains no 
internal identification of the first or the last components. Such an identi¬ 
fication can be incorporated by adding a null component between the last 
and first components of x. Alternatively the identification may be achieved 
without augmenting the dimension but by sacrificing the end-around 
chaining, i.e., by replacing the last component of f Tfy) by a null element. 
Moreover, a chained representation may be entered (i.e., the scan may be 
begun) at any one of several points, provided only that the index h and 
corresponding leading address / are known for each of the points. 

The number of components of a chained representation scanned (steps 
1-3 of Program 3.4) in selecting the kth component of x is given by 
v(x) | 0 (k — h), where h is the index of the component last selected. The 
selection operation is therefore most efficient when the components are 
selected in ascending order on the index. The chaining is effective in the 
forward direction only, and the component (/? -— 1) would be obtained 
only by a complete cyclic forward scan of v(x) — 1 components. The 
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representation is therefore called a forward chain. A backward chain can 
be formed by incorporating the vector jIVy) instead of jT^y), and a 
double chain results from incorporating both. 

A vector x which is respecified only by either deleting the final com¬ 
ponent or by adding a new final component (i.e., by operations of the 
form x co 1 / x , or x +- x © (z)) behaves as a stack (cf. Exercise 2.6). A 
backward-chained representation is clearly convenient for such a stack. 

A simple example of the use of a chained stack occurs in representing 
the available (i.e., unused) segments of the physical vector n. This will be 
illustrated by a program for the vector compression 

X v/x 

executed on a forward-chained representation of x. The unused segments 
representing the components of v/x are returned to a backward-chained 
stack or pool of available components. A linear representation can usually 
be used for logical control vectors such as y; in any case the problems 
involved in their representation are relatively trivial and will be subordi¬ 
nated by expressing each operation directly in terms of the logical vectors 
and not in terms of the physical components representing them. 

Program 3.5. In the major loop (6-23), k determines the index of the current 
component v k , and / and j determine the leading addresses of p(^ A .) and p(*a:+i)> 
respectively. These three parameters are cycled through successive values by 
steps 7, 8, and 12 and are initialized by steps 2, 5, and 12. If v k = 0, the infix 
p(* fc ) is returned to the pool by steps 21, 22, 23, and 6 so as to construct a back¬ 
ward chain. 

The parameter x specifies the leading address of p(tfj) unless v(x) = 0, in which 
case x is null. Step 1 terminates the process if r(*) = 0, and otherwise step 4 
respecifies x as the null element. If v =0, this null value of x remains; if not, the 
first nonzero component of v causes a branch to step 14. Since x = o, step 15 is 
executed to respecify x as the leading address of p((v/^) 1 ). Step 16 then specifies //, 
the leading address of the last completed component of v/x. Step 15 is never 
again executed. 

Components of v/x other than the first must each be chained (in a forward 
chain) to the preceding one. Hence the leading address / of a newly added com¬ 
ponent must be inserted in the last preceding component (whose leading address 
is h ). This is normally done by steps 18, 19, and 6; step 20 respecifies h. If, how¬ 
ever, the component x k _ Y were also included, it would appear as the last com¬ 
pleted component of v/x and would already be chained to the new component x k . 
This situation is recognized by step 17 and occasions a branch to step 16. Step 16 
then respecifies h and repeats the loop without executing steps 18, 19, and 6. 

The process terminates when the cycle through the chained representation of x 
is completed, that is, when / returns to the original value of x, preserved as / by 
step 3. Step 10 is then executed, terminating the process directly if v(vjx) = 0. 
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Otherwise, step 11 is executed to close the chain of v/x, that is, to insert x, the 
leading address of p((r/^) x ), in the representation of the last component of v/x. 



0-origin indexing for n only 

X 

Leading address of p(^) 
if v(x) > 0; otherwise 

X — o 

V 

Logical vector. 

k 

Index of v. 

i 

Leading address of p(* fc ). 

j 

Leading address of p(x k+1 ). 

h 

Leading address of last 
preceding component of 
v/x. 

P 

Leading address of last 
preceding component of 
pool of available seg¬ 
ments. 

g 

Dimension in n of ele¬ 
ments of grid matrices. 

b 

Base of representation of 
elements of grid matrices. 


Legend 


Program 3.5 Program for x v/x on a forward chained representation of x 
and a backward chained stack of available segments 


A chained representation can be generalized to allow the direct represen¬ 
tation of more complex constructs, such as trees, by incorporating the 
address of each of the successor components associated with a given 





§3.2 Representation of structured operands 115 

component. This notion is formalized in the chain list matrix of Sec. 3.4. 
The same scheme can also be employed to produce an efficient combined 
representation of two or more vectors which share certain common 
components. If, for example, x 5 = z k9 and chained representations are 
used for both x and z, then x may be represented in standard form except 
that component x j incorporates a secondary address, which is the leading 
address of z k+v Moreover z has a standard representation except that 
z k _ x is chained to x j9 with an indicator to show that the secondary address 
of the succeeding component is to be used. Deletion of any vector 
component in such a shared system must occasion only the corresponding 
change in the address chain of the vector, the actual representation of the 
component being deleted only when no associated address remains. 

Partitions 

If the set a is the range of the components of the physical vector n, and 
if some element, say a l9 is reserved as a partition symbol and is excluded 
from use in the normal representation of quantities, it can be inserted to 
demark the end (or beginning) of an infix of n. If the vector y is repre¬ 
sented by a single infix of n such that the beginning of component y j+1 
follows immediately after the terminal partition of y j9 then the structure of 
y is completely represented by the partitions, and y is called a partitioned 
representation. A partitioned representation can be used for more complex 
operands, such as matrices, if a set of two or more distinct partition 
symbols are provided, one for each level of structure. The distinct 
partition symbols can, of course, be represented by multiple occurrences of 
a single symbol a x rather than by distinct members of a. 

A partitioned representation is similar to a double-chained representa¬ 
tion without end-around chaining in the following particular: beginning 
from component y i9 the component y i can be reached only by scanning 
all intervening components between / and j in increasing or decreasing 
order according as i < j or i > j. The file notation introduced in Sec. 
1.22 clearly provides the operations appropriate to a partitioned repre¬ 
sentation of a vector, with conventions which suppress all inessential 
references to the partitions themselves. 

The use of a partition to demark the end of an infix is particularly 
convenient when the infix must be processed component by component 
for other reasons, as in the use of magnetic tape or other serial storage. 
The partition also appears to be more economical than the grid matrix, 
which it replaces. This apparent economy is, however, somewhat illusory, 
since the reservation of a special partition symbol reduces the information 
content of each nonpartition component by the factor log 2 (v(a) — 1) 
log 2 v(a) 9 where a is the range of the components of n. 
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Partitions can be employed in chained representations. For example, 
the dimension in n of each component of a chained representation y can be 
specified implicitly by terminal partitions instead of explicitly by the vector 
r 2 Cy) of the grid matrix. Thus if the elements of Tfy) are of dimension g 
in 7T, then co 1 /? (y,) = a l9 and (a 9 A ZP)/p (y 3 ) = p(^), where * is the 
vector represented by y. Program 3.6 shows the determination of pO fc ) 
from a chained representation y with terminal partitions a v 



0-origin indexing for n only 

*./ 

/is the leading address of the 
hi h component of the chained 
representation of x. 

b 

Base used for representation of 
the elements of the grid matrix. 

g 

Dimension in n of the elements 
of the grid matrix. 

a i 

Partition symbol. 

z 

&th component of the chained 
representation of x exclusive of 
the terminal partition symbol. 

d 

Dimension of z in n. 



Legend 

Program 3.6 Determination of p(* fc ) from a chained representation of x with 
terminal partitions cq 

Program 3.6. The program is similar to Program 3.4 and the step numbering 
indicates the correspondences. The dimension d is so determined (steps 4a-d) as 
to exclude the terminal partition itself from the quantity z specified by step 5. 
Since only the first column of the grid matrix is incorporated in the partitioned 
representation, step 6 excises a prefix of dimension g rather than 2 g as in 
Program 3.4. 

Pools 

Components of the physical vector n in use for the representation of one 
quantity must not be allocated to the representation of some other quantity. 
The construction of a chained representation therefore poses one problem 
not encountered in its use , namely, the specification and observation of 
restrictions on the availability of components of n. The restrictions can 
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conveniently be specified as a pool , consisting of the available components 
of 7i. Each allocation made must then be reflected in a corresponding 
change in the pool. Moreover, as each piece of data is deleted, the com¬ 
ponents allocated to it are returned to the pool. 

If, as in Program 3.5, a pool is treated as a stack, then the component 
next taken from the pool is the component last added to it. The queue of 
components in the pool thus obeys a so-called last in first out , or LIFO 
discipline. The dimension in n of the last component of a pool will not, in 
general, agree with the dimension required for the next quantity it is called 
on to represent. If it exceeds the requirements, the extra segment may be 
left in the pool, and the pool therefore tends to accrue more and more 
components of smaller and smaller dimension. Hence it may be wise, or 
even essential, to revise the pool occasionally so as to coalesce the segments 
into the smallest possible number of infixes. This process can even be 
extended to allow substitutions in other vectors in order to return to the 
pool short segments which may unite existing segments of the pool. This, 
however, will require a systematic scan of the chained vectors. 

If the dimension of the last component (or perhaps of all components) 
of the pool falls short of the requirements for representing a new quantity, 
segments of the pool can be chained together. This requires the use of a 
special partition symbol or other indication to distinguish two types of 
links, one which marks the end of a given representation and one which 
does not. More generally, it may be convenient to use multilevel partition 
symbols to distinguish several levels of links, as was suggested for the 
representation of a matrix. 

Queue disciplines other than LIFO may be used. Three other types of 
primary interest in allocation queues are the FIFO (first in first out), the 
dimension-ordered , and the address-ordered disciplines. FIFO uses a 
forward chain and may be preferred over LIFO because it uses the entire 
original pool before using any returned (and usually shorter) segments. 

The components of a dimension-ordered pool are maintained in 
ascending (or descending) order on their dimensions in 7r. This arrange¬ 
ment is convenient in selecting a pool element according to the dimension 
required. The components of an address-ordered pool are arranged in 
ascending order on their leading addresses. This arrangement facilitates 
the fusion of components which together form an infix of 7i. 

If each of the available components of n is set to a special value which 
is used for no other purpose, then the available components can be 
determined by a scan of n. Such a pool has no structure imposed by 
chaining and will be called a marked pool. 

A marked pool requires little maintenance, since components returned 
to it are simply marked, but selection from it requires a scan of 7r and is 
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therefore relatively slow. The use of marked and chained pools may also 
be combined—all returned components go to a marked pool which is left 
undisturbed until the chained pool is exhausted, at which time the entire 
marked pool is organized into a chained pool. 

Summary 

Since any structured operand can first be reduced to an equivalent 
vector, the problems of representation can be discussed in terms of vectors 
alone. The characteristics of the linear, chained, and partitioned repre¬ 
sentations of a vector may be summarized as follows. A linear representa¬ 
tion permits the address of any component to be computed directly as a 
linear function of its indices and hence requires no scanning of the vector. 
However, the strict limitations which it imposes on allocation may en¬ 
gender : (1) conflicts with allocations for other operands, (2) waste of storage 
due to the imposition of a common dimension in n for all components, or 
(3) uneconomical execution due to the extensive reallocations occasioned 
by the insertion or deletion of other than terminal components. 

The concept of the grid matrix is helpful even when the corresponding 
allocation is implicit in the program. The explicit use of a grid matrix 
which is itself in a linear representation removes the restrictions on the 
allocation of the vector itself while retaining the advantage of direct 
address computation. The address computation differs from the linear 
case only in the addition of a single reference to the grid matrix and hence 
requires no scanning. The difficulties enumerated for the direct linear 
representation are not eliminated but merely shifted to the linearly 
represented grid matrix itself, where they may, however, prove much less 
serious. 

A chained representation allows virtually arbitrary allocation, relatively 
simple operations for the insertion and deletion of components, the direct 
representation of more complex structures such as trees, and economical 
joint representations of vectors which have one or more components in 
common. However, a chained representation requires extra storage for 
the grid matrix which it incorporates and occasions additional operations 
for scanning when the components are selected in other than serial order. 
The required scanning can be reduced by the retention of auxiliary 
information which allows the chained representation to be entered at 
several points. 

A partitioned representation requires the allocation of a single infix of 
7i, and selection requires a fine scan, i.e., a component-by-component scan 
of 7i to detect partition symbols. Partitioning removes the need to 
incorporate the grid matrix explicitly and does not impose a common 
dimension in n for all components. 
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Mixed systems employing combinations of linear, chained, and parti¬ 
tioned representations are frequently advantageous. Block chaining , for 
example, involves the chaining of blocks, each consisting of an infix of n 
and each serving as a linear representation of some infix of the represented 
vector. Alternatively, each chained block may be a partitioned represen¬ 
tation of some infix. 

3.3 REPRESENTATION OF MATRICES 

Structured operands other than vectors may be represented by first 
reducing them to equivalent vectors which can, by employing the tech¬ 
niques of the preceding section, be represented, in turn, in the physical 
vector 7i. In the case of a matrix A, two alternative reductions are of 
interest, the row list r = E/A = A 1 ® A 2 ® • • • © and the column 
list c = E//A. If r h . A/, and c k are corresponding elements of the three 
alternative representations, then in a 0-origin system: 

h = vi + y, 
k = i + nj- 

Consequently, 

i = [h -r v\ = (i | 0 k, 
and j = v | 0 h = [k /u \. 

The dependence of h on k can be obtained directly by substituting the 
foregoing expressions in the identity 

h = v X [h 4- v\ + v | 0 h 

to yield h = v x (ju | 0 k) -f [k 4- [i J. 

Similarly, k = [jl X (v | 0 h) + [h 4- 

The permutation h which carries the row list r into the column list c 
(that is, c = h$ 0 r) can be obtained directly from the foregoing expression 
for h as follows: 

h = v x (jue | 0 i°) + [i° -=- pe\. 

The expression for the fcth component of h is identical with the expression 
for h above. Hence, if c = hj 0 r, then c k = r hk = r h as required. 

If the row list (or column list) is itself represented linearly, then the 
address of any component A / is obtained as a linear function of the indices 
i and j. If either a file or a chained representation is to be used for the list 
vector, then the components are processed most efficiently in serial order, 
and the use of column list or row list is dictated by the particular processes 
to be effected. 
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If a large proportion of the elements of a matrix are null elements, it is 
called a sparse matrix. Sparse matrices occur frequently in numerical 
work (where zero serves as the null element), particularly in the treatment 
of partial difference equations. A sparse matrix A can be represented 
compactly by the row list r = U/A , and the logical matrix U , where 
U = (A 0). The matrix A may then be obtained by expansion. 
A = U\r. 

Alternatively, the column list c = (A ^ 0 )//A may be used. The 
transformation between the column list c and row list r must, in general, 
be performed as a sequential operation on the elements of U. Since it is 
frequently necessary to scan a given matrix in both row and column order 
(e.g., as either pre- or post-multiplier in a matrix multiplication), neither 
the row list nor the column list alone is satisfactory. A chaining system 
can, however, be devised to provide both row and column scanning. 

Let L be a matrix such that L x is a list of the nonzero elements of a 
matrix A in arbitrary order, L 2 * is the column index in A of element L x \ 
and Lf is the row index in L of the next nonzero element following Lp in 
its row of A. If Lp is the last nonzero element in its row, Lf = °. Let f i 
be the row index in L of the first nonzero element of row A\ and let 
fj = 0 if A j = 0. The following example shows corresponding values of 
A, L, and /: 


16 0 0 9\ 


5 3 »' 


l 3 \ 

0 3 0 0 


6 1 5 


'4 

0 0 0 0 

L = 

3 2 0 

/ = 

O 

7 8 0 4 


9 4 ° I 


[ 6 

\0 0 5 0/ 1 

7 11 


w 


\ 4 4 °/ 

The matrix L will be called a row-chained representation of A and may be 
used, together with the vector /, for the efficient scanning of any row A i as 
illustrated by Program 3.7. The vector L 3 can be modified so as to give the 
address in n directly rather than the row index in L of the next element in 
the row, and Program 3.7 can then be easily re-expressed in terms of the 
physical vector n. 

Program 3.7. Step 2 yields the index in L of the first element of the /th row of A. 
Step 4 determines its column index j, and step 6 determines the index of the 
succeeding component. The process terminates at step 3 when the scan of the 
row is completed. 

If L x is chosen as a row list, the vector L 3 reduces to the form Lf = k + 1 
or Lf = °. Its function can then be served instead by incrementation of 



§ 3.4 


Representation of trees 121 


1 -origin indexing 

fi 

Row index in L of first nonzero 
element of row A i .f i = ° if A 1 =0. 

k 

Row index in L of next element. 

Lx 

List of nonzero elements of A. 

V 

Column index in A of Lf. 

V 

Row index in L of next nonzero 
element following L-f in its row in 
A. L z k = ° if no such element 
exists. 



Legend 

Program 3.7 Determination of the row vector A i from a row-chained represen¬ 
tation of A 

the index k and by the use of the logical vector u = (L 3 = °c) for deter¬ 
mining the end of each row. 

The construction of a column-chained representation is analogous to 
that of a row-chained representation, and the two representations can be 
combined in a single matrix L which gives both row and column chaining 
employing but a single representation (that is, LJ of the nonzero elements 
of A. 


3.4 REPRESENTATION OF TREES* 

A tree T may be represented by a matrix and hence, in turn, by a vector 
in a number of useful ways as follows: 

1. by a full right list matrix ]T or by any column permutation thereof 
(Sec. 1.23), 

2. by a full left list matrix [T or by any column permutation thereof, 

3. by a right list matrix a 2 /]T, 

4. by a left list matrix a 2 /[T, 

5. by various chain list matrices. 

The full left and right lists seldom prove more convenient than the more 
concise left and right lists. Except for the special case of a homogeneous 

* Johnson (1962) provides a comprehensive treatment of the representations of trees 
and discusses the suitability of each representation for a variety of search procedures. 
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tree, both the right list and the left list are awkward to use for path tracing. 
This function is better served by the chain list matrix, to be defined as a 
formalization of the chaining scheme suggested in Sec. 3.2. 


Simplified list matrices 


In certain important special cases, the various list representations of 
trees may be simplified. If the degree of each node is a known function 6 
of the value of the node, then for any list matrix M, Mf = <5(M 2 *), and the 

degree vector M ± may be eliminated with¬ 
out loss. The node vector alone then repre¬ 
sents the tree and may be referred to as a 
right or left list vector as the case may be. 

For example, in the tree of Fig. 3.8 
(which represents the compound logical 
statement x A (y V z )), a fixed degree is 
associated with each of the logical operators 
and, or, and not (namely, 2, 2, and 1), and the 
degree zero is associated with each of the vari¬ 
ables. The statement can therefore be repre. 
sented unambiguously by the left list vector 



Figure 3.8 The compound 
logical statement x /\ (y \j z) 


V = (A, ,x, v,y,z). 


This is the so-called Lukasiewicz, Polish, or parenthesis-free form of the 
compound statement [Lukasiewicz (1951) and Burks et al. (1954)]. 
Frequently, the only significant nodes of a tree T are its leaves (e.g., in 
Example 3.2 and in a certain key transformation of Fig. 4.7) and all other 
nodes may be considered as nulls. Hence if M is any list matrix, the 
significant portions of M x and M 2 are (M 1 ^ 0)/M 1 and (M 1 = 0)/M 2 , 
respectively. These significant portions may then be coalesced to form the 
single vector 

» = /M l5 (M, = 0), m 2 /, 


which, together with the logical vector (M x = 0), forms a leaf list matrix 
that describes the tree. Moreover, if the values of the leaves are distin¬ 
guishable from the components of the degree vector, the logical vector 
(M 1 = 0) may also be dropped. 

The use of left lists 

The use of the right list matrix is illustrated by the repeated selection 
sort treated in Sec. 6.4. The use of left lists will be illustrated here by two 
examples, each of interest in its own right: the partitioning of the left list 
of an tt-tuply rooted tree to yield the left lists of the component singular 
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< 


Program 3.9 Partitioning of the left list of an /7-tuply rooted tree 

subtrees and the construction of a Huffman minimum-redundancy prefix 
code. 

Example 3.1. Partitioning of an w-tuply rooted tree. Program 3.9 shows a 
scheme for partitioning a left list Z of a tree T into component subtrees, i.e., for 
determining the vector p such that pj is the moment of the singular subtree T,. 
Thus v{p) = ^(T), pj = fi(Tj), and the infix ((p + ctf -1 ) j a p i)l/Z is the left list 
of Tj. 

The loop 6-10 scans successive components of the degree vector Z x (in ascend¬ 
ing order) and computes r, the indicated number of roots. The value of r in¬ 
creases by, at most, one per iteration, and when r becomes unity, the end of a 
singly rooted tree has been reached. Its moment m is then appended (step 11) as 
a new final component of the partition vector p, the parameters m and r are reset, 
and the scan of the next rooted tree is begun. Normal termination occurs at 
step 3; termination at step 6 indicates ill formation of Z. 

Example 3.2. Huffman minimum redundancy prefix code. If b is any set such 
that v(b) > 1, then any other finite set a can be encoded in h, that is, represented 
by b. (The sets a and b may be called the “alphabet” and “basic alphabet,” 
respectively.) If v(a) < v(b), the encoding may be described by a mapping vector 
k such that p(a,-) = b k . If v(a) > v(b ), then each must be represented by a 
vector x l c= b. For example, if a = i°(10) and b = i°(2), then the decimal digits a 
may be encoded in the so-called 8421 system: 


1 — > 

P *- «(0) 

2 

i <- 0 

3 


i ■■ (<Z) 

4 


m*- 0 

5 


r <-0 

6 


i : M z ) 

7 


i — i + 1 

8 


m <-m - 1-1 

9 


r <-r - hi - Zp 

10 


r : 1 

11 


p <-p © (m) 



1-origin indexing 

Z 

Given left list of T. 

i 

Row index of Z in ascending 
scan. 

r 

Indicated number of roots of 
current rooted subtree. 

m 

Moment of current rooted 
subtree. 

P 

Partition vector of Z, that is, 
pj = MT,). 


Legend 


(2 £ (4)) J_ x* = 
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( 0 ) 

(1 0 0 0 ) 

(10 0 1 ) 

(10 1 ) 

(1 10 ) 

( 111 ) 

Figure 3.10 Construction of a Huffman prefix code 

In so-called fixed length coding the vectors x i have a common dimension d , and 
the decoding of a message m (consisting of the catenation of vectors x l ) involves 
the selection of successive infixes of dimension d. If the probability distribution 
of the characters a x occurring in messages is not uniform, more compact encoding 
may be achieved by using variable length codes and assigning the shorter codes to 
the more frequent characters. Decoding of a message in variable length coding 
can be performed only if the boundaries between the successive x 1 are indicated in 
some way. 

The boundaries between characters in a message in variable length code may 
be demarked by special partition symbols (which is inefficient) or by using a 
prefix code in which no legitimate code point x i is the prefix of any other legitimate 
code point, including itself. The index vectors of the leaves of any tree possess 
this property; conversely, any set of prefix codes can be arrayed as the leaves of 
some tree. Hence if each character of the set to be encoded is assigned as the leaf 
of a common tree, and if each character is encoded by the associated index vector, 
a so-called prefix code is attained. Figure 3.10 furnishes an example of a binary 
code (i.e., the branching ratios do not exceed two) constructed in this manner. 
0-origin indexing is used. The discussion will be limited to binary trees. 

If fi is the frequency of the /th character and is the length of the assigned code 
(i.e., the length of path to the root), then the most efficient code is attained by 
minimizing the scalar product / + /. This may be achieved by the following con¬ 
struction, shown to be optimal by Huffman (1952). First, the characters to be 
encoded are ail considered as roots, and the two roots of lowest frequency are 
rooted to an auxiliary node (shown as a null element in Fig. 3.10), which is then 
assigned their combined frequency. The process is repeated until only two roots 
remain. The tree of Fig. 3.10 is optimal with respect to the frequencies shown to 
the left of the leaves. The appropriate combined frequencies are shown to the left 
of each of the nonleaves. 

Programs 3.11 and 3.12 show the construction of the tree T representing a 


0.38@ 
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T <-c 
fXi(T) : 2 
T-(0//)JT 

T - \\(o) © a 2 //T, a 1 , a 2 //T\\ 

f <~\ + /a 2 //, a 1 , a 2 //\ 

Program 3.11 Construction of the binary Huffman code T for characters c 
with frequency/ 

Huffman code for a set of characters c { with frequencies f t , the former in terms 
of the tree itself and the latter in terms of its left list. 

Program 3.11. The frequency vector/ is permuted (step 5) to bring it to ascend¬ 
ing order, and the tree is subjected (step 3) to the same permutation. Step 4 
replaces the first two rooted subtrees of T by the single subtree obtained by root¬ 
ing them in a null, and step 6 makes the corresponding alterations in the 
frequency vector. The tree is initialized (step 1) as a one-level tree whose roots 
are the given characters, and the process terminates when the number of roots of 
T has been reduced to two. 

Program 3.12. The tree T of Program 3.11 is represented by the left list node 
vector z , in conjunction with the implicit degree vector d = 2 x (z = o € ). The 
algorithm differs from Program 3.11 primarily in the reordering of the subtrees 
(steps 6-9). Step 7 appends to x the left list of the ith subtree (of the reordered 
tree) selected by the partition vector p according to the conventions of Program 
3.9. Step 10 prefixes x by the new null root, and steps 11-12 redefine p 
appropriately. 

Program 1.21 can be applied to the left list produced by Program 3.12 to deter¬ 
mine the associated index matrix (in a 0-origin system), and hence the actual 
codes assigned. 

It is not essential that the characters be assigned to leaves in precisely the order 
specified by Programs 3.11 and 3.12, and it is sufficient that the dimension of the 
leaf index increase monotonically with decreasing frequency of the character. It 
is therefore unnecessary to carry the characters themselves through the process; 
it suffices to determine the structure of the tree, sort the corresponding index 
matrix to right list order (which is ordered on dimension of the index vectors), 
and assign the characters (in decreasing order by frequency) to successive leaves. 
Since the structure of such a tree (whose nodes have a common irrelevant value 
and whose nonleaves all have a common branching ratio equal to the number of 
roots) is sufficiently determined by the moment vector p-(T), the process of Pro¬ 
gram 3.12 can be simplified. 
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2 


p *- e(v(z)) 
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1-origin indexing 

c 

Given character set. 

z 

Left list of Huffman 

tree. 

Si 

Frequency of zth sub¬ 
tree of z. 

p 

Partition vector p { is 
the moment of the zth 
subtree of z. 

X 

Reordered left list 
with subtrees in 
ascending order on 
frequency. 


Legend 

Program 3.12 Construction of the 
left list z of the binary Huffman 
code for characters c with fre¬ 
quency / 


Chain list matrices 

The full chain list matrix of a tree T is a matrix P of dimension ju( T) X 
(d(T) + 2) defined as follows: P 2 is some node vector of T, P x is the 
associated degree vector, Pj +2 is null if j exceeds the associated degree Pp 
and is otherwise the row index in P of theyth node emanating from node 
P 2 . Table 3.13 shows a full chain list matrix for the tree of Fig. 1.16. A 
full chain list matrix is called a full right {left) chain list matrix if the nodes 
occur in right (left) list order. 

The full chain list matrix is a formalization of the scheme suggested in 
the discussion of chained representations (Sec. 3.2). Its convenience in 
forward path tracing is obvious. Since it does not identify the roots of the 
tree, an auxiliary vector must be provided for this purpose. However, if 
the ordering chosen for the nodes is that of a right list, the roots occur 
first in the list, their number r = v(Px) — (+/Pi) is specified by the degree 
vector P l9 and the need for the auxiliary vector vanishes. Moreover, since 
a right list groups all nodes emanating from a given node, each row of 
a 2 /P is simply a sequence of integers followed by null elements, and the 
information necessary to path tracing is provided by the column P 3 alone. 

The right chain list matrix of a tree T is therefore defined as a 3 /P, where 
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A full chain list matrix The right chain Filial-heir chain list 

list matrix 

(a) (b) (c) 

Table 3.13 Chain lists of the tree of Fig. 1.16 
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1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 


P is the full right chain list matrix of T. It is illustrated by Table 3.136 
Program 3.14 shows its use in path tracing. Although the degree vector P 1 
is redundant (that is, P ± and P 3 can be determined one from the other), it 
provides a direct check (step 6) on the legitimacy of the index vector r 
which would be difficult to obtain from P 3 alone. 

For a search of the type described by Program 3.14, it is necessary to 
scan down a level until agreement is reached and then across to the next 
level. For this type of scan, the filial-heir chain list is compact and con¬ 
venient. 
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1-origin indexing 
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Given index vector. 
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d^-viPJ - +IP 1 
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Right chain list matrix of T. 
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j ■ v (r) 


Pi 

Degree vector of T. 

5 
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Node vector of T. 

6 
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p 3 
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Chaining vector of T. 

Path vector T r . 
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d 

Degree of current node. 
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Pi 


k 

Base address of the infix con¬ 
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d <- Pp 



taining the current node. 

10 


k 


i 

Index of succeeding node in the 
path T r . 



j 

Current index of index vector 




r. 


Legend 

Program 3.14 Determination of the path p = T r from the right chain list 
matrix P 


The set of (j + l)th level nodes of the subtree are collectively called 
the jth filial vector of node i, and the first member of the first filial vector of 
node i is called the heir of node i. (For brevity, the first filial vector of a 
node will also be called its filial vector.) If each node is chained only to its 
successor in the filial vector containing it and to its heir, the resulting 
representation is called a filial-heir chain list. Formally, the filial-heir 
representation of a tree T is a matrix F of dimension /u( T) x 4 such that 
F 2 is a node vector of T, F ± is the associated degree vector, F s is a filial 
chain such that Ff = j if node Ff is the successor of node Ff in the 
smallest filial set containing it and Ff = ° if node F 2 l has no such successor, 
and F 4 is an heir chain such that Ff = h if node F 2 h is the heir of node Ff 
and Ff = o if F 2 l is a leaf. The filial-heir chain list is illustrated in Table 
3.13c. 
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EXERCISES 

The symbols a and c will be used exclusively to denote lower case and capital 
alphabets defined as follows: 

a = (o, a, b, c, . . . , z,. ,, , #, *, + ). 
c = ( 0 , A, B, C, ..., Z, + ). 

The expression nc * will be used to specify the set x as the range of the com¬ 
ponents of 71. 

3.1 For each of the following cases, specify a suitable encoding matrix and 
format vector and show the explicit value of the infix of 7t which (in a solid 
representation) represents the given example vector x: 

(a) the decimal digits d = i°(10) in a ranked fixed-length code for tt c= i°( 2). 
Example: x = (6, 8, 9). 

(b) the set a in a ranked fixed-length code for n c: i°(2). 

Example: x = (c, a, t). 

(c) the set a u c ^ i°(10) in a fixed-length code for n c: i°(10). 

Example: x = (M, a, y, o, 3,,, 1, 9, 6, 0,.). 

(d) the set a u c in a two-case code (with single-character shift) for n c: a. 
(See Brooks and Iverson, 1962.) 

Example: x = (T, r, o, y,,, N, ., Y,.). 

(e) the set a in a Huffman prefix code for 7T c. i°( 2). Assume the frequency 
distribution given in Dewey (1923). 

Example: x = (t, r, e, e). 

3.2 For each of the cases of Exercise 3.1 write a program which decodes the 
infix (/1 0 L j )/n , that is, which produces the vector z represented by the infix. The 
auxiliary physical vector n 1 c s may be employed to represent the first column of 
the encoding matrix, where s is the set encoded. Perform a partial trace of each 
program for the example value used in Exercise 3.1. 
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3.3 The ordered set of months m = (JANUARY, FEBRUARY, 
DECEMBER) is to be represented by the physical vector tc c c u i°( 10). For 
each of the following types of representation, specify a particular representation 
and show the values of the relevant components of re: 

(a) a linear representation (employing null elements for filling to a common 
dimension in k). 

(b) a solid representation for each element of m and an appropriate grid 
matrix itself represented linearly. 

(c) a chained representation. 

(d) a double chained representation. 

3.4 (a) For each of the cases of Exercise 3.3, write a program which selects 

month m k . 

(b) Trace each program for the case k = 2. 

(c) For case (d) of Exercise 3.3, write a program which selects m k by 
forward chaining if k < v(m) 2, and by backward chaining if 
k > v(m) -r 2. 

3.5 For each of the cases of Exercise 3.3, write a program which “prints out” 
the set of months in a minimum number of ^-character lines, inserting a single 
null between successive months except where (i) further nulls must be added to 
prevent the continuation of a single word from one line to the next, or (ii) no null 
is needed between two successive words, the first of which is coterminous with 
the line. In other words, produce a matrix Z of row dimension n and of minimum 
column dimension such that (Z ^ °E)/Z = (p(mj © p(m 2 ) © • • • © p(m 12 ), 
and such that each row Z* may be partitioned into one or more vectors of the 
formp (m k ) © o€, all but the last of which must be of dimension v[p(m k )] + 1. 

3.6 Assuming a linear representation for each of the logical vectors involved, 
and a forward-chained representation for each of the remaining operands, write 
programs for the following operations. Assume in each case that the arguments x 
and y need not be retained, and assume the use of a backward-chained pool 
where necessary. 

(a) z *-\x,u,y\ 

(b) z lx, u,yI 

(c) z k | x 

(d) z <- k \x 

3.7 Repeat Exercise 3.6(a), using separate grid matrices for x, y, and z instead 
of chained representations. Specify a suitable linear representation for each of 
the grid matrices. 

3.8 (a) If a chained representation is used for a vector x, then the selection of a 

specified component can be made faster by providing a number of 
alternative starting points for the required scan. State precisely the 
quantities required in such a process and write a program showing its 
use. 

(b) If provision is made for starting the scan at any component of x, the 
chained representation may itself be simplified. Show precisely what 
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the simplified form is and identify the type of representation to which it 
is equivalent. 

3.9 Frequently a vector x kept in a partitioned representation (for efficient use 
of storage) must be “unpacked” to a linear or other more accessible form for 
efficient processing. The converse operation of “packing” is also required. Let 
the partitioned representation be a file ® employing an intercomponent partition 

and a terminal partition X 2 , and write both packing and unpacking programs 
for each of the following cases. Assume that the maximum dimension in n of any 
component is n. 

(a) A solid linear representation employing null fill. 

(b) An allocation prescribed by a grid matrix G with G 2 = ne. 

3.10 Let n c; i°(2), let the set a be encoded in a five-bit code such that (2c) J_ 
p(a,) = /, and let each component of the vector x be an (uncapitalized) English 
word. Using 0-origin indexing throughout, specify a suitable partitioned repre¬ 
sentation in 7t for the vector x, and repeat Exercises 3.9(a) and 3.9(b), using it in 
lieu of the files. 

3.11 For each of the following pool organizations, write a program to convert 
a given marked pool into a backward-chained pool: 

(a) dimension-ordered. 

(b) address-ordered. 

3.12 For each of the following queue disciplines, write programs which take 
from and return to the pool an infix of length n. Use secondary linking and 
•relegate to a marked pool any infix which is too short for linking. In each case 
choose the type of chaining best suited to the particular queue discipline. 

(a) LIFO (last-in-first-out). 

(b) FIFO (first-in-first-out). 

(c) Dimension ordered. 

(d) Address-ordered (utilize the possibility of fusing adjacent infixes). 

3.13 Give a complete specification of a scheme for representing a tree T by a 
full chain list matrix which is not in right list order. Write a program (expressed 
in terms of the physical vector n) which determines the path vector T 1 for a given 
index vector i. 

3.14 Give a complete specification of a scheme allowing joint representation of 
those components shared by two or more of a family of vectors x 1 , x 2 , . . . , x n as 
suggested in Sec. 3.2. Write programs to (i) select component */, and (ii) delete 
component x/. 

3.15 Let n cz a u i°( 10), and let x 1 , x 2 , . . . , x n be a family of vectors whose 
components belong to the set a 1 /[a u i°( 10)]. Let the average and the maximum 
dimensions of the vectors x i be a and m, respectively. Assume that the chaining 
index is represented in decimal, with each digit represented by one component 
of 7T. Determine (as a function of m and ri) the value of a below which a chained 
representation provides more compact storage than a linear representation with 
null fill. 
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3.16 Write a program which uses the minimization operation u +- v [ x to 

determine the ordering permutation vector p b). 

3.17 Let U = (X 0) and r = U\X jointly represent the sparse matrix X. 

(a) Write a program which determines (as a function of U and r) a suitable 
row-chained and column-chained representation of X. 

(b) Write a program defined on the representation produced in part (a) to 
compute the product Y = X + X, itself represented in the form V = 
(Y * 0) and p = V/Y. 

(c) Write a program to determine the trace (that is, +11IX) of X from the 
representation produced in part (a). 

3.18 The unique assignment of Huffman codes produced by Program 3.12 is, 
in general, only one of many equally efficient assignments, since the symbols to be 
coded need only be assigned, in decreasing order on frequency, to the leaves of 
the code tree in increasing order on their levels. Show that the structure of the 
tree produced can be sufficiently described by its moment vector alone, and write 
a program for the construction of a Huffman code based on this fact. 

3.19 Following the notation and terminology used in Program 3.9 for the 
analogous case of a left list write a program which determines from the right 
list R of a tree T, the partition vector p which partitions it by levels. 

3.20 Write a program which determines the right list R = a 2 /]T as a function 
of the left list L = a 2 /[T. Incorporate tests of well formation. 

3.21 Let [X°iy denote the /?th power of the square matrix X with respect to the 
operators O x a 2 nd 0 2 » that is, [Xgi] p = Xgi Xgi • • • giX to p factors. 

(a) Show that ([C v y)i — \ if and only if there is a path of length p from 
node / to node j in the graph («, C). 

(b) Show that [C v y = o for some p < v(C) if and only if (n, C) contains no 
circuits. 

(c) If (n, C) contains no circuits, the connection matrix C is said to be “con¬ 
sistent.” The result of part (a) can be used to check consistency. Program 
the alternative method of Marimont (1959). 

(d) If H = C v /, then ([H v y)f = \ if and only if / = j or there exists a path 
from node i to node j of length n < p 4- 1. Show that for any connection 
matrix C, [H v y converges to a limit. 

3.22 Devise programs to determine 

(a) whether a given connection matrix C represents a tree. 

(b) the left list of the tree (n, C). 

(c) the right list of the tree (n, C). 

(d) a node list n and connection matrix C as a function of 

(i) a left list L 

(ii) a right list R. 

3.23 Show that (n, C) and (n p , C p P) represent the same graph for any permuta¬ 
tion p. 

3.24 If (n, C) is a tree and if K = C v c, then C can be determined as a function 
of K (see Ross and Harary, 1960). Write a program for determining C from K. 
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SEARCH TECHNIQUES 


In classical applied mathematics most functions of interest can be 
approximated by some algorithm which becomes, for practical purposes, 
the definition of the function. In other areas, however, many functions of 
practical, if not general, interest (such as the correspondence between 
employee name and salary) can be specified only by an exhaustive listing of 
each argument value and its corresponding function value. Such a function 
will be called fortuitous. 

The basic algorithm applicable to the evaluation of a fortuitous function 
is a search of the list of arguments, i.e., a comparison of the given argu¬ 
ment with the list of arguments to determine the correspondent to be 
selected. In such an algorithm it is convenient (as illustrated by Program 
1.12 a) to distinguish three phases which successively determine the 
following quantities: 

(1) the index or rank r = k i k of the argument k in k. 

(2) the index i = p r of the correspondent in s. 

(3) the correspondent a = s h 

Step (2) is a permutation defined by the permutation vector p. Steps (2) 
and (3) are simple selections from structured operands (normally in linear 
representations) and require no further discussion. Step 1 is called 
ranking , and the methods for accomplishing it merit detailed treatment. 

The argument k of a mapping (and hence of a ranking) operation will be 
called a key. The German-English dictionary mapping of Example 1.2 is 
typical of mappings from key to correspondent. Ranking is itself a special 
mapping from the key set k onto its own index set i\v(k)). 

If the representation used for some or all of the data imposes certain 
restrictions (such as serial access), there may be some advantage in 
coalescing the three phases of the mapping operation so as not to determine 
the rank explicitly. It will, however, be convenient to limit the discussion 
almost exclusively to the problem of ranking. 

There are two main types of ranking processes: scanning and key 
transformations. A scanning process compares the key k successively with 
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selected elements of the key set k to determine the rank r = k ik. A key 
transformation is any function or algorithm t{k) which maps the set k into 
some subset of the integers. The set of derived keys is defined as the set 
d containing all derived keys arranged in ascending order. The set of all 
keys which map into d j is called the jth equivalence class defined by the 
transformation t. 

If v (d) = v(k), the key transformation is biunique and the ranking 
operation may therefore be completed by a permutation p such that 
pj — i for j = t(k z ). If v(d) < v(k), then at least two distinct elements of k 
map into the same element of d and the ranking process must be completed 
by a scan of one of the equivalence classes defined by t. 

If, for example, k = (m, tu, w, th, f) is the ordered set of working days 
encoded according to the encoding matrix 

1 Im 0 0 1 \ 

2 tu 0 1 0 

3 C = w 0 1 1 

4 th 1 0 0 

5 'fill / 

and format vector / = (0, 1, 1, 1), then scanning can be accomplished by 
comparing p (k), the encoded representation of the key k , with successive 
rows of f/C to determine the rank r of the row on which agreement occurs. 
Moreover, the key transformation 

t(k) = (2e) 1 p (k) 

is unique, but requires an associated mapping vector m = (1, 2, 3, 4, °, 
°, 5). Had p(f) (that is,//C 5 ) been chosen as (1,0, 1), the mapping vector 
would not have been required. Finally, the key transformation 

t\k) = 1 + p 3 (k) 

has the range d = (1, 2), is not unique, and requires a subsequent scan of 
one or other of the equivalence classes e 1 = (tu, th), and e 2 = (m, w, f), 
represented by (010, 100) and (001, Oil, 111), respectively. 

Although a strict ranking operation maps element k, into the integer /, 
any biunique mapping onto the index set 0(v(k)) will frequently serve as 
well. For, if p is the permutation required to complete the ranking process, 
and if a subsequent permutation j is required (as in step 4 of Program 
1.12tf), the two permutations can be combined in the single permutation 
q = jjjp. Similarly, the ranking of a set k may be considered as equivalent 
to the ranking of any set obtained by permuting k. 
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4.1 SCANNING METHODS FOR RANKING 

The two main methods of scan are called directed and controlled. A 
directed scan is begun in one of two possible directions from a given initial 
point /, either ascending (that is, /, i + 1, / + 2, . . .) or descending. The 
direction chosen may be determined by a comparison between the given 
argument and the set element k { at the initial point i. A controlled scan is 
executed in a sequence which is determined by successive comparisons 
between the argument and each element scanned. In an effective controlled 
scan, each comparison must determine the choice of the next element for 
comparison so as to (approximately) minimize the expected number of 
elements scanned. The directed scan is clearly well suited to the use of a 
file or chained representation, which imposes serial access to the elements, 
whereas the controlled scan is not. 

The scan length (i.e., the number of key elements scanned) will be used as 
a measure in analyzing and evaluating scanning methods. The same 
measure and the same analysis apply also to the converse situation, where 
the rank of an element is given and the element itself must be obtained 
from a chained or other representation which permits only serial access. 
An alternative related measure is the normalized scan length or scan 
fraction , defined as the scan length divided by the number of elements in 
the set of keys. 

A scan is said to be rooted if each execution begins at the same point r. 
A rooted scan may be advantageous when the frequency distribution of the 
arguments is nonuniform and the most frequent keys can be grouped near 
the root. A scan is called catenated if each execution is begun at the end 
point of the preceding scan. A catenated scan may be employed in using 
a file when the intervals between successive scans are so short as to allow 
little or no time for return to a fixed root or when the arguments are 
arranged in the same relative order as the items in the file. 

Directed scan 

A directed scan is called cyclic if element k x follows k v in an ascending 
scan and if k v follows k 1 in a descending scan. It is called noncyclic if the 
direction of scan is reversed whenever either of the terminal elements k x or 
k v is encountered. A cyclic scan is appropriate to a chained representation 
with end-around chaining; a noncyclic scan is appropriate to a file or to a 
chained representation without end-around chaining. 

The initial direction of scan may be chosen in several ways, the more 
important of which are enumerated and discussed below. For independ¬ 
ently and uniformly distributed arguments, the expected scan fractions 
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are summarized in Table 4.1. Certain of the results are also plotted in 
Fig. 4.2. 

Initial direction fixed (fixed scan). The ascending direction will be assumed. 
If the scan is rooted and cyclic, the root may, without loss of generality, be 
assumed to be one. The expected scan length for a set is then given by 

v(k) 

e= zm) x / = /i i 1 

i = 1 

where f(k .) is the normalized expected frequency of occurrence of the 
argument k t and / is the corresponding frequency vector defined by 
ft = /(*<)• 

The most efficient fixed rooted scan is therefore obtained by using the 
permuted set a = (0/(—f))$k such that the components of a are arranged 



Figure 4.2 Plot of cumulative probabilities of Table 4.1 (Numbers refer to the 
entries in Table 4.1) 
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in decreasing order on frequency. If the distribution of arguments is 
uniform (that is,/, = 1 M/))> then the expected scan length is ( v{k ) + 1) H- 
2, and the expected scan fraction is approximately one half. 

If a fixed scan is cyclic and catenated, the expected scan length depends 
on the distribution /, but if the arguments are independently distributed, 
then the expected scan length is independent of the ordering of the scanned 
set k. This may be shown as follows. The expected length l r of a scan 
rooted at r is given by M r + /, where M is the square matrix such that 
M r = (r — 1) 1 1 1 . The probability of beginning a catenated scan at r is 
the probability of ending the previous scan at r, that is, f r . Consequently, 

e=/J/=/+M+/. 

Since, in general, f+M+f = f+M+f, then 2e = (/ + N + /), 
where N = M + M. If, for example, v(k) = 4, then 
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It is easily shown that N is of the form 

N = 2E + v{k)\ 

and consequently N% = N for any permutation p. But / + N + / = 
(/p) J (N£) + (/ p ) in general, and since N% = N, then f + N+f = 
( f p ) J N J (/ p ). Hence the expected scan e = (J)x/JN+ / remains 
the same for any permutation of/ or, equivalently, for any permutation of 
the key set k. 

If the arguments are not independently distributed, the analysis is, in 
the general case, very complex. However, a simple but effective use of 
correlation is made in the method of batching. If a is a collection or batch 
of uncorrelated arguments, each of which is to be ranked in the set k , then 
the total expected scan time for a fixed catenated scan will be v{a) times the 
expected scan time for a single item. If, however, the set a is ordered on k 
(that is, a = k n a), then the entire set a may be ranked in a fixed cate¬ 
nated scan whose normalized length does not exceed one.* If a given 

* The length of the scan will be determined by the maximum rank (in k) occurring in 
the set a. The expected value of the scan fraction is approximately equal to the expected 
value of the maximum occurring in a sample of size n = v(a) chosen from the continuous 
interval from zero to one. This value is known for various distributions [e.g., Cramer 
(1951) p. 370]; for a uniform distribution it is n -r (n + 1). 
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argument set z is not ordered on k, it may first be permuted by some sorting 
process to yield the set a = pjz which is ordered on k. The set a may then 
be ranked in k and, if required, the set of ranks may then be subjected to 
the inverse permutation q = p i 1 to yield the ranks in the original set z. 
The decrease in the total expected scan length may far outweigh the effect 
of the additional permutations required. 

If a fixed scan is noncyclic and rooted at one , the expected scan length is 
the same as for the fixed cyclic rooted scan. The fixed noncyclic scan is 
generally unsuited to any initial points r other than one (e.g., to a catenated 
scan), since the first (r — 1) elements of k are then reached only after a 
reversal of direction and a rescan of the set d r [k. 

For the case of a uniform distribution, the behavior of the fixed scan is 
summarized in entries 1, 3, and 4 of Table 4.1. The derivation will be 
illustrated in discussing entry 7. 

Initial direction giving shortest scan. For a fixed root r, the minimum 
expected scan is achieved if the items are disposed on either side of the root 
so that the frequency is a monotone decreasing function of |/ — r |, the 
scan length in a direct scan to the argument. In a cyclic scan, the position 
of the root is immaterial; in a noncyclic scan it is best centered at the floor 
(or ceiling) of (v(k) + 1) 2. In a 0-origin system this expression 

becomes v(k) -i- 2. 

For an arbitrary frequency function, the expected scan length is given by 
the scalar product/ J /, where / = li 1 — re |. For a uniform distribution, 
the results are given in entries 2 and 5 of Table 4.1. For a catenated scan, 
the corresponding results appear in entries 2 and 6. 

The possibility of choosing the initial direction so as to give the shortest 
scan to the argument depends on the information available. If the elements 
of k are strictly ranked on some function g(k t ), then the shortest direction 
from root r to argument x can, in the noncyclic case, be determined by a 
comparison of g(x) and g{k r ). For the cyclic case this does not suffice, and 
it is necessary* to know the index in k of the argument x. This case is 
therefore of interest primarily in selecting a specified element from a 
serial-access representation and is of little interest in an actual ranking 
operation. However, any double-chained representation or reversible file 
can be used in a noncyclic as well as a cyclic manner and hence admits of a 
choice of direction which is best in the noncyclic sense. 

Initial direction to nearer (farther) terminal. If the value of the root r is 
known for each individual scan in a catenated scan, the direction to the 
nearer terminal can be determined by comparing r with the midpoint 

* Approximating functions may, however, be used for estimating the index and the 
probable best direction. 
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( v(k ) + 1) ~ 2. A noncyclic scan starting toward the nearer end is clearly 
less efficient than one starting toward the argument, but it may be useful 
when the most direct route to the argument cannot be determined. The 
expected scan fraction is shown in entry 7 of Table 4.1; its analysis will 
illustrate the method used in constructing the entire table. 

It is assumed that the number of elements v(k) is sufficiently large that 
the scan fraction / may be considered as a continuous variable. Let 
p r (/ < b) be the probability that scan fraction/ does not exceed b , and let 
the function be represented in three parts such that pr (/ < b) = pr, (/ <b) 
in the ith half-unit interval in b. Let x be the normalized initial position of 
a given scan. Then 0 < x < 1, and, since the scan always begins toward 
the nearer terminal, the fraction of the set covered in a scan of length b is 
the same for the starting point (1 — x) as for x. Using this symmetry, 
attention can be restricted to values of x in the range 0 to For the function 
p r i (/ < b ), the value of b is also restricted to the range 0 to f. 

Consider fixed values of b and x with b < f. If 0 < x < b/2 , the fraction 
of the file covered by a scan of length b is given by b — x, for the scan 
begins at x, proceeds a distance x to the nearer terminal, and returns to 
the point b — x. If b/2 < x < b , the fraction covered is clearly x , for the 
scan will reach the nearer terminal but will not return past x. If b < x < f, 
the scan does not reach the nearer terminal, and the fraction scanned is 
therefore b. Since x is uniformly distributed, the function pr x (/ < b) is 
obtained by integration as follows: 

" [b/2 [b [A 1 _ Jy2 

pr 1 (f < b) = 2 (b — x) dx + x dx + b dx — -b b. 

LJo 2 Jb J 2 


The factor of two arises from the symmetry in x and the restriction of x 
to the interval 0 < x < Similarly, 

r [b /2 [A -i b 2 { 

pr 2 (J < b) = 2 (b — x) dx + x dx =-1— , 

LJo Jh /2 J 2 4 

and 

~ Cb-l [A 1 c 

pr 3 (/ < b) = 2 dx + (b — x) dx = — b 2 + 36-. 

LJo J&-i J 4 


Entry 8 shows the behavior of the scan starting toward the farther 
terminal. Although the distribution differs markedly from that obtained 
for starting toward the nearer terminal, it has the same expected value off. 
As may be expected, the function obtained for a fixed scan (entry 4) is 
the average of the functions obtained for cases 7 and 8 and is linear in b. 
Case 7 (toward nearer terminal) yields the smallest maximum scan length 
of the three. 
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Initial direction reversed {continued) from previous scan. In the absence of 
any other basis of choice, the initial direction can be chosen as a reversal 
or continuation of the direction which terminated the preceding scan. The 
behavior is shown in entries 9 and 10 of Table 4.1. The relations between 
the functions for fixed scan ( F ), continuation (C), and reversal {R) are 

F - R = —{F - C) < 0 for b < 1, 
and F - R = —{F — C) > 0 for b >1. 

Controlled scan 

The sequence followed in a controlled scan is commonly determined by 
a comparison which determines the relative ranking of any pair of elements 
x and y in the set k. It will therefore be assumed that comparison of the 
argument x — k h with the element kj determines whether h < y, h = y, or 
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i <- 1 
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Program 4.3 Ranking of x in k by binary search 

h > j. The subsequent scan may then be limited to one or other of the 
two subsets a 5-1 /^ and a j /k. The maximum dimension of the subset remain¬ 
ing to be scanned is therefore minimized by choosingy = {(v(k) + 1) 2j. 

If each subsequent element for comparison is chosen so as to (approxi¬ 
mately) halve the dimension of the set remaining to be scanned, the 
process is called binary search. Program 4.3 shows the details of binary 
search; i and k are the indices of the terminal elements of the remaining 
subset, and y is the index of the element selected for comparison. 

If v{k) = 2 fc , then any one of 2 j ~ x different arguments may be isolated on 
the yth comparison, for j e i 1 ^), and the one remaining argument will be 
located on the (k + l)th comparison. Hence for a uniform distribution of 
arguments, the expected number of comparisons required in a binary 
search is given by 

e b (2 k ) = (1 • 2° + 2 • 2 1 + 3 • 2 2 + • • • + k • 2*” 1 + (k + 1)) -h 2 k . 
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It can be shown (e.g., by induction on k) that 

e b (2* ) = [(k - l)2 k + k + 2]-r2 k . 

The expected number of comparisons is therefore approximately (k — 1), 
and for a general valueof v{k ), the number is approximately flog 2 v(k)] — 1. 
The expected value thus differs but slightly from the maximum value 

ri og2 (v(k) + i)i. 

If e s (v(k)) is the expected number of comparisons required in a fixed 
scan of k and if r is the ratio of the execution time for one step of binary 
search to the execution time for one step of fixed scan, binary search is 
(for a uniform distribution) the more or the less efficient according as e s 
exceeds or is exceeded by re b . Although the simplicity of Program 4.3 
suggests that the ratio r is small, it will be large if the representation of the 
elements of k permits serial access only. 

The methods may be combined by using k steps of binary search to 
select one of 2 k subsets, which is then subjected to a fixed scan. If the 
remaining subset contains m elements, the (approximate) reduction in the 
expected number of comparisons achieved by one further step of binary 
search is e s (m) — e s (m/ 2), and binary search should therefore be discon¬ 
tinued when e s (m) — e s (m/2) < r. For a uniform distribution, this result 
yields the following approximate expression for the optimum number of 
steps of binary search: 

J - K (it) • 

The ranking type of comparison required in determining the sequence 
in a controlled scan is always attainable for any arbitrary set k or for 
some permutation thereof. For, if p(fc 7 ) is the representation of k { in 7i, 
if 7i c i 9(b), if t t = (be) _j_ p(£*)> and if a = (0/t)$k, then the relative 
ranking of any pair of elements of a can be determined by comparing the 
base b values of their representations. If, for example, b = 10, and the 
four successive elements of k are represented by (1, 0, 9), (0, 6, 4), (7, 1, 3) 
and (5,0,6), then a is represented by (0,6,4), (1,0,9), (5,0,6), and 
(7, 1, 3), and relative ranking in a is determined by comparing elements as 
decimal numbers. 

In the execution of the binary search, the calculation of the index j (next 
element for comparison), and the explicit determination of the terminal 
indices i and k can be avoided by associating with each element k j a pair of 
indices which indicate the two possible succeeding choices for j. More 
precisely, if M is a matrix of dimension v(k ) x 3, whose first column is the 
set k , and whose second and third columns are vectors of indices (or nulls) 
from the set i 1 (r(^)) u (°), then Program 4.4 describes a directed scan of k. 
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-> j *- [(/i(M) + 1) 

—> x : 

- 

- j *- M 3 1 2 3 4 5 6 7 8 9 

Program 4.4 Generalized binary search 

The elements of M 2 and M 3 can be so chosen as to execute a binary search 
equivalent to that of Program 4.3. This is true, for example, for the 
matrix M of Fig. 4.5. 

The ordering of the elements of is clearly immaterial, i.e., if M x were 
permuted, then columns M 2 and M 3 could be respecified so as to yield the 
original scanning order. One consequence of this is the fact that the rows 
can be reordered so that the scan conveniently begins with the first row 
rather than with row [(}u(M ) + 1) -r- 2j. A more important consequence 
is the possibility of applying the method to the problem of multiple keys, 
which will be raised in the treatment of key transformations. 

As illustrated by Fig. 4.5, the matrix M specifies a tree whose nodes are 
the elements of M 1? whose branching ratios are two, and whose paths are 
traced by Program 4.4. The columns M 2 and M 3 can clearly be chosen to 
specify a scan sequence other than that of binary search. In particular, the 
element selected for comparison may be chosen so as to equalize (as 
far as possible) the total probability of the arguments in the two resulting 


1 bee ° ° 

2 cab 1 3 

3 cat o 4 

4 dog ° ° 

5 M = egg 2 7 

6 foe o o 

7 had 6 8 

8 hoe o 9 

9 nod ° © 

Figure 4.5 Tree traced by Program 4.4 
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subsets rather than to equalize the number of elements. This procedure 
yields the most efficient controlled scan. If the probability distribution is 
uniform, the method reduces to binary search. If the arguments are 
drawn from two or more sets having distinct probability distributions, the 
matrix M may be enlarged to include two index columns for each distinct 
set. Each such pair of columns may then be designed to provide an 
optimum scan for the associated distribution. 

4.2 KEY TRANSFORMATIONS 

Since a key transformation maps the set of keys k into a set of integers 
(the set of derived keys d ), any unique key transformation produces a 
derived key which can be used to select the component of a mapping vector 
directly and thus complete a ranking operation without the use of a 
scan. If the transformation is not unique, it may still be used to partition 
the original set k into v(d) subsets for scanning and so reduce the expected 
scan time. Ideally a key transformation should be both simple and unique 
and should produce a derived set d having a narrow spread; in practice, 
compromises must be made. 

Let k be the domain and d the range (in ascending order) of a key 
transformation t(k t ) and let e j be the equivalence class in k which maps 
into d j9 that is, t(x) = dj for all x e e j . The coalescence of t in k is then 
defined as the vector c such that c 5 = v(e j ), for j e i 1 (r(d)). Since the 
equivalence classes are disjoint and collectively exhaust k , then +/c = 
v{k). The spread of t in k is defined as 1 -f d v — d v Thus if k is the set 
(Sunday, Monday, . . . , Saturday), and if t maps each day into the rank 
(in the alphabet) of its leading letter, then d = (6, 13, 19, 20, 23), the 
spread s = 18, c = (1, 1, 2, 2, 1), and +/c = v(k) = 7. 

The key transformation is biunique if and only if c = e. Moreover, if 
the transformation t is biunique, the ranking operation (i.e., the determina¬ 
tion of the index of the argument in k) can be completed by a mapping 
vector whose components are selected by the index j = /(fc;) — d x + 1, 
and whose dimension is equal to the spread of t in k. The key transforma¬ 
tion of the preceding example is biunique when restricted to the set x = 
(Sunday, Monday, Tuesday), the set of derived keys is (13, 19, 20), and 
the mapping vector m = (2, o, o ? o ? o, o, 1, 3) of dimension eight serves 
to complete the mapping if its components are selected by the index 
j = /(*,) - 12. 

A key transformation is called j-origin if d x = j. Since the origin can be 
changed by subtraction of a constant, attention will be restricted to 
1-origin transformations. The spread of a 1-origin transformation is 
clearly d v . 
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A biunique key transformation is always 
attainable since, as remarked in the treat¬ 
ment of directed scan, the base b value of 
the representation of the elements of the 
domain k can be used. The spread of such 
a transformation may, however, be im¬ 
practicably large. If, for example, x were 
some small subset of the set of all ten- 
letter sequences, (e.g., all meaningful ten- 
letter words), then 5 1 would be 26 10 , and 
the required dimension of the associated 
mapping vector would be impracticably 
large. 

In general, if each element of k is of 
dimension h in n and if the (used) range 
of each element of n is the set i°(Z>), then the 
mapping vector required is of dimension 
b h . The use of the base b value of the rep¬ 
resentation in selecting the component of 
the mapping vector is equivalent to select¬ 
ing a path through a uniform 6-way tree as 
illustrated (using 0-origin indexing) in Fig. 

4.6, for b = 3 and h = 3. The branch to 
the yth level is selected according to they'th 
component of the representation. 

Sequential level-by-level selection in the 
tree is less convenient than the direct use 
of the base b value, except that the former 
frequently allows the full tree to be greatly 
contracted. If, for example, the tree of Fig. 

4.6 is used for the set k = (200, 010, 120, 

001, 022, 202) (as indicated by the numeric 
leaves whose values are the 0-origin ranks 
in k), then the full tree can be contracted 
to the nonhomogeneous tree of Fig. 4.7. 

The contraction is defined formally as 
follows: if the subtree rooted in a given 
node contains no significant leaves, the 

subtree is replaced by a single null leaf; if the subtree contains exactly 
one significant leaf, the subtree is replaced by that leaf. The contracted 
tree can then be represented by a chain list matrix or, since all nodes save 
the leaves are null, by a leaf list matrix M. For the example of Fig. 4.6, 



Figure 4.6 Uniform tree and 
mapping vector for the set k = 
( 200 , 010 , 120 , 001 , 022 , 202 ) 
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Figure 4.7 Contracted tree and associated leaf list matrix M 
for the set k = (200, 010, 120, 001, 022, 202) 


M is given in Fig. 4.7. The sequential biunique key transformation on 
the leaf list matrix of the contracted tree is described by Program 4.8. 

Program 4.8. The components of the argument x are scanned by the index j, 
and step 5 determines / as the index of the current node in the path. If node / is 
not a leaf, then step 6 determines k as the index of the first node reachable from 
node i. If node / is a leaf, then k is specified as the value of the leaf, and the right¬ 
pointing exit is followed on step 8 unless the exit at step 7 is taken first. This exit 
occurs only if x is an illegitimate argument which leads to one of the null leaves 
(such as the last leaf of Fig. 4.7) remaining in the contracted tree. The contraction 
was performed in the specified manner so as to allow the incorporation of such a 
test. If it is not required, the tree can be further contracted by eliminating the 
null leaves. The left-pointing exit on step 4 also indicates an illegitimate argu¬ 
ment x , but one of insufficient dimension for the particular path specified. 

The biunique key transformation provided by Program 4.8 is very 
effective when applied to a set whose dimension is small compared to the 
spread of the transformation produced by taking the base b value of the 
representation as, for example, in a glossary of English words.* A dis¬ 
advantage of the process is the need to revise the entire leaf list matrix 
when additions to or changes in the argument set occur. The process can 

* See, for example, Lamb and Jacobsen (1961). 
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Legend 

Program 4.8 Biunique transformation on key represented by x , using the leaf 
list of the contracted tree of key set k. 

be modified to produce a simpler but nonunique transformation by 
contracting the tree further so that some or all of the remaining leaves each 
represent two or more significant leaves of the original tree. 

Nonunique key transformations 

Although it is frequently impossible to obtain a sufficiently simple 
biunique transformation having a sufficiently small spread, it is always 
possible to produce a simple key transformation of arbitrarily small spread 
if the requirement of uniqueness is dropped. For example, the spread of 
the key transformation 

j — L((6e) _L p(*)) -r d\ 

varies inversely with d, but the transformation is usually nonunique for 
d> 1. 

If a key transformation is not unique, the ranking must be completed by 
a scan of one of the equivalence classes which it defines. The scan of each 
of the equivalence classes e j may, in general, be either directed or con¬ 
trolled, and the individual subsets may be ordered by frequency of occur¬ 
rence, by the base b value of their representations, or by some externally 
imposed (e.g., chronological) order. If a chained representation or file is 
used for each subset, a directed scan is normally used. 

The expected length of a directed scan of each of the equivalence classes 
e j may be computed and weighted by the relative frequency of the class to 
yield an expected over-all scan length. If the distribution of arguments is 
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uniform, the expected scan length for e j is given by 

(v(e j ) +1)t2 = (c, + 1)t2, 

where c is the coalescence of the transformation. Moreover, the relative 
frequency of arguments from e j is c j — (+/c). Consequently, the over-all 
expected scan length / is given by 

/= K c + c . ) i c ] = r 1 + £if 

2( + / c ) L c J € 

For a fixed dimension of the derived set d (and hence of c), and for a 
necessarily fixed value of +/c = v(fc), the value of / is clearly minimized 
if the coalescence vector c is uniform, i.e., if the components of c are all 
equal. Hence the expected scan length is minimized by a key transforma¬ 
tion whose equivalence classes are of equal dimension. 

A given key transformation is frequently employed to rank a variety of 
subsets of its domain k rather than k itself. For example, if k is the set of 
English words in a given dictionary, then one of the subsets to be ranked 
may be the set of distinct words in a particular sample of English text. If 
a particular subset of k is specified, then the coalescence of the key trans¬ 
formation in the specified subset x can be determined, and the transforma¬ 
tion can be chosen accordingly. More generally (as in the case of samples 
of English text), the active domain x may be only partially specified. The 
transformation should then be chosen so that its coalescence is nearly 
uniform for the expected active domains. If, for example, k is the set of 
all five-letter sequences, and if each active domain is the set of five-letter 
sequences beginning with a specified letter, then the key transformation 
used should depend only on the last four letters. If the set of derived keys 
produced by a key transformation has a spread s and a random uniform 
distribution within that spread, then the expected length of a scan (of the 
equivalence classes) can be shown* to be 1 + v(k) ~ 2s. 

Scanning of the equivalence classes. If an element x is to be ranked in k by 
a scan of k itself, no auxiliary information is required since the rank of 
component k j is simply its index j. If some permutation of k is used 
instead, then an auxiliary ranking vector r (i.e., a permutation vector) 
must provide the rank in the given set k. Specifically, if y = pjfc is used 
for ranking, then r is the permutation vector inverse to p, and the rank of 
element y j is r, r Finally, if the vector y is itself to be scanned in some 
prescribed order other than simple cyclic order, the order may be repre¬ 
sented by a chaining vector q. 

The vectors y and r or y, r, and q can be combined into a two-column 



See Johnson (1961) and Exercise 4.4. 
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matrix S or a three-column matrix C which contains all information 
requisite to the ranking operation. More generally, a collection of mat¬ 
rices F , V, etc., can be used, each representing some subset of the given key 
set k. 

The method of scanning the equivalence classes defined by a given key 
transformation is largely determined by the type of matrix or matrices used 
to represent the key set k. The five major methods of practical interest are 
enumerated below. Each is illustrated in Fig. 4.9. A 1-origin key trans¬ 
formation t is assumed. 

(a) Overflow. A two-column matrix F of column dimension s = d v 
represents the first elements of the equivalence classes (and their ranks), as 
follows: 

= «i \) * I V(d)), 

Fi =°,H d. 

All remaining elements of the sets (i.e., a l /e l ) are represented in arbitrary 
order in a two-column “overflow” matrix V. 

The scan procedure is given by the Program of Fig. 4.9 a. If the given 
argument x is not equal to F^ x \ then the overflow matrix V is scanned in 
ascending order. The left-pointing exit indicates that x $ k. 

For a uniform distribution, the expected scan length is clearly given by 

/ = 1 + WY) + 1) x + 2 v{k\ 

where = v(k) — v(d). The expected scan length is therefore large 
unless the average dimension of the equivalence classes (that is, v(k ) 4- v(d)) 
is close to unity. For a known nonuniform distribution, the expected scan 
can be reduced by placing the most frequent element of each equivalence 
class in F and ordering the elements in V according to their frequency. 

(b) Overflow with chaining. The two-column matrices F and V used in 
method (a) can each be augmented by a third column chaining vector 
which chains each equivalence class. Thus F$* is the row index in V of 
element ef if it exists, and is otherwise null. Similarly, if Vf = ef then 
Vf is the row index in V of e] fl if it exists, and is otherwise null. The 
program is given in Fig. 4.9 b. The expected scan length for a uniform 
distribution can, as shown earlier, be expressed in terms of the coalescence 
vector c as follows: 



(c) Single table with chaining. In the overflow methods [(a) and (b)], 
certain rows of the matrix F go unused, and a saving in storage can be 
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effected by combining the three-column matrices F and V in a single table 
T of column dimension 

p(T) = max (s, v(k )), where s — d v = ju(F). 

Let u be a logical vector such that u)}}{p{F)) = d. Then u//F constitutes 
the unused and u//F the used rows of F. Let v be a vector of dimension 
ju(T) obtained by appending a zero suffix to u. The first two columns of 
T are then defined as follows: 

«’//(«*/T) = «//(« 2 /F), 

«//(« 2 /T) = «*/(F»), 

where p =2 i 1 (//( T7 )) and r(p) = +/r. (The vector p permits an arbitrary 
reordering of the rows of V). The third column of T is a chaining vector 
which chains each of the equivalence classes e\ 

The appropriate scan program (Fig. 4.9c) is similar to that of method 
(b), and the expected scan length is identical. The serious disadvantage of 
the method lies in the construction of the matrix T —all of the rows 
(specified by v) required for the leading elements of the equivalence classes 
must be known before any of the nonleading elements can be allocated. 
The table T is therefore usually constructed in two passes over the given 
key set. Moreover, any addition to, or change in, the active key set k 
which introduces a new equivalence class may occasion reallocation of 
some row of T . 

(d) Single table with chaining and mapping vector. The main deficiency 
remarked in method (c) is occasioned by the fixed vector v and the fixed 
order of the rows of v//T, both imposed by the given key transformation t. 
The difficulty can be shifted from the matrix T to a mapping vector m 
which is used (as indicated in the program of Fig. 4.9 d) to effect a further 
transformation of the index / = t{x). The rows of T may then be arranged 
in any desired order, provided only that for each h e 1 1 (v(rf)), = j, 

where e / = k k = T x \ and t(e x h ) = /. Moreover, if T 1 = k , then the 
ranking vector T 2 may be omitted. 

Except for the extra step occasioned by the operation j <— m h the 
expected scan length is again the same as for method (b). Flowever, the 
requirement that /u(T) > max (s, v(k)) may now be relaxed to the form 
KT) > v(k), whereas v(m) must equal or exceed 5 1 . Since the squared 
length of the coalescence vector (that is, c + c) can, in general, be reduced 
by increasing the spread s of the transformation /, the expected scan length 
can now be reduced at the cost of increasing the dimension of the mapping 
vector m rather than at the (usually much higher) cost of increasing the 
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column dimension of T. A similar advantage can be gained by employing 
the mapping vector m in methods (a), (b), and (c). 

(e) Open addressing system .* The open addressing system employs a 
single table T but requires neither the chaining vector T 3 nor the mapping 
vector m. As shown by the program of Fig. 4.9e, each argument x is 
obtained by a forward scan of 7\, beginning at component T^ x) . Since 
the scan is cyclic, it is necessarily successful. It can also be made fairly 
efficient by constructing T as follows. The matrix is first specified as a 
matrix of nulls. The elements of k are then assigned in order, element k j 
being assigned to the first unassigned row following row t(k 3 ) — 1. 

The program of Fig. 4.9e describes both the construction and the use of 
the table T. The branch on step 3 can occur only if the element x has not 
yet been entered in the matrix T, and steps 5 and 6 then complete its entry 
and the specification of the corresponding component of the ranking 
vector T 2 . The use of T can, in fact, proceed concurrently with its con¬ 
struction, i.e., each argument x presented to the program defines a new 
entry in T if it has not previously occurred, the k index of x being deter¬ 
mined by some algorithm independent of T. 

If the active argument set k is not fixed, it may be desired either to add 
new elements or to respecify the rank of some element already defined in 
T. Respecification may be incorporated by allowing the scan used in 
defining an entry in T to terminate on encountering either the null element 
or the argument itself. Although respecification of an entry may be 
allowed, deletion of an entry and its replacement by the null element cannot, 
for the occurrence of such an inserted null element between the beginning 
point i = t(kj) and the point at which kj is entered in T would later cause 
an erroneous indication that kj was not defined in T. Replacement of a 
deleted entry by a special “deletion character” distinct from the null 
element could, however, be used. 

The expected scan length in the open addressing system exceeds that for 
method (d), since the expected length of scan of each equivalence class is 
increased by the potential interleaving of elements from different equiva¬ 
lence classes. Thus, in the example of Fig. 4.9e, the expected scan lengths 
for each of the equivalence classes (Sunday, Monday, Saturday), (Tuesday, 
Thursday), (Wednesday), and (Friday) are (1 + 2 + 6)/3, (2 + 3)/2, 1, 
and 1, respectively, yielding an over-all expected scan length of 16/7. The 
corresponding scan lengths for a chained scan (e.g., method (d)) are 
(1 + 2 + 3)/3, (1 + 2)/2 , 1 and 1, with an over-all expected scan length of 
11/7. However, since it uses a fixed scan, the open addressing system is 
better suited to a serial store than is the chained system. 

* The open addressing system appears to have been first used by A. L. Samuel, G. M. 
Amdahl, and E. Boehm in constructing address tables for an assembly program. 
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If the derived keys are uniformly distributed in the range 1 to p{T) 
then, as shown by Schay and Spruth (1961), the expected scan length is 
1 + p -T- 2(1 — p ), where p = v(k) 4- //(J). For a nonuniform distribu¬ 
tion, the expected scan length can be reduced by allocating the most 
frequent elements first, i.e., by defining T from the set k reordered in 
descending order on frequency. 

Bucket files. In certain files the locations divide naturally into blocks or 
buckets of n successive locations each, such that the entire contents of any 
bucket can be scanned in virtually the same time required to scan any one 
location in the bucket. Such a file is called a bucket file (Peterson, 1957). 
In a magnetic disc file, for example, each track forms a bucket. Each of the 
foregoing methods of scanning equivalence classes can be adapted to suit 
the characteristics of a bucket file. The equivalence classes can be 
grouped in buckets, with chaining provided only from bucket to bucket. 

Clustering. * The active argument sets of interest may be relatively small 
subsets of the complete set k. Moreover, their elements commonly share 
some characteristic so that a key transformation which gives uniform 
coalescence and uniform spacing of the derived keys with respect to k 
may yield highly nonuniform coalescence or nonuniform spacing, or both, 
with respect to a given active domain x. This effect is called clustering. If, 
for example, each element of k is represented by a vector of decimal digits 
of dimension ten, then the key transformation 

t(x) = [((10c) _|_ p(z)) -1- 10 7 J 

yields a mapping onto the range i°(10 3 ) which has both uniform coalescence 
and uniform spacing. On the active domain x, whose elements are all 
represented by vectors p(x) such that a 3 /p(z) = (2, 4, 7), however, all 
elements “cluster” in the single derived key 247. 

The deleterious effects of such correlations among elements of the 
active domain can be reduced by employing key transformations which 
depend on all components of the representation and do so in a manner 
which shows no systematic relationship to the structure of the representa¬ 
tion. The mid-square method , for example, consists in squaring the given 
key and extracting the middle digits of the resulting product. A commonly 
used transformation is the taking of residues modulo some number m 
such that m > v(x) and is either prime or contains few factors. 

* Note added in proof: M. Hanan and F. P. Palermo offer an important solution 
to clustering by the application of Bose-Chaudhuri codes. R. T. Chien and C. V. 
Freiman have remarked a similar application of Fire codes (private communications). 
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4.3 MULTIPLE KEYS* 

If in some mapping operation the access to both the key set and the set 
of correspondents is serial (or partially serial), considerable advantage can 
be gained by replacing the vector T 2 of ranks by the suitably reordered set 
of correspondents, that is, T 2 i becomes the correspondent of the key Tf. 
For, the ranking operation on the argument kj which gives access to the 
element Tf = k j also gives immediate access to the correspondent Tj in 
the same row T\ This is equivalent to eliminating the permutation 
operation (through reordering of the set of correspondents) and coalescing 
the ranking and selection phases so that together they require a single 
access to the (partially) serial memory. 

For a single functional correspondence, the coalescing of the ranking 
and selection phases can (by a suitable ordering of T) be accomplished by 
the single-table process (Fig. 4.9c) without introducing the mapping 
vector m of process (d). Frequently, however, a number of related func¬ 
tional correspondences must be provided between pairs of a family of 
vectors Oj so ordered that Q* corresponds to Qf for all i,j, and k. In an 
accounting system, for example, Q v 0 2 , 0 3 , and Q 4 might be, respectively, 
the vector of account numbers, names, addresses, and balances in a given 
ledger. Those vectors which may occur as arguments in a mapping process 
are called key vectors; those which never occur as arguments are called 
satellite vectors. 

Q may be reordered (and augmented by a suitable chaining vector) so as 
to permit the use of the program of Fig. 4.9c for some selected key set Q { . 
However, for any other key set Q j9 the order will, in general, be unsuitable. 
The program of Fig. 4.9 d may, however, be used together with an appro¬ 
priate mapping vector m? and chaining vector q j . For the sake of uni¬ 
formity and the advantage of allowing an arbitrary ordering for 0 , the 
distinguished key set Q { may also be provided with a mapping vector m* 
and treated like the rest. 

The generalized binary search of Program 4.4 can be applied to the case 
of multiple keys by providing a pair of chaining vectors (M 2 and M 3 ) for 
each key. The open addressing system is clearly unsuited to multiple keys. 
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EXERCISES 

4.1 Give a formal proof of the fact that the binary search of Program 4.3 will 
not work properly if steps 5 and 6 are replaced by the statements k <-j and i <- j, 
respectively. 

4.2 (a) The argument in the set of keys k = (k lf fc 2 ,. • . , K) occurs with the 

relative unnormalized frequency /'. For the case v(k) = 10, design the 
matrix M which will minimize the expected scan length when applying 
Program 4.4. 

(b) Show how additional keys may be incorporated in the system of part (a) 
without revising the entire matrix M. Discuss the effects on the expected 
scan length. 

4.3 Consider the ledger L defined as 
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and the argument domains k 1 , k 2 , and fe 3 , consisting of all 4-digit decimal numbers 
(account numbers), all 12-letter names (in capitals, with null fill), and all 14- 
character addresses (alphanumeric with null fill), respectively, and let the rows of 
<x 4 /L, (4 l a 12 )/L, and o> 14 /L represent the corresponding active domains x\ x 2 , 
and tf 3 . 

(a) Specify a simple key transformation on the set k 1 whose range lies in the 

set which, when applied to the active key set x 1 , yields a derived 

key set of dimension three or greater. 

(b) Reorder the ledger L and add a chaining vector to chain the equivalence 
classes so that the resulting matrix M may be used with the key trans¬ 
formation of part (a) and a program of the type of Program 4.9c. Show 
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the resulting matrix M and the specific program used for selecting the row 
M* determined by an argument a e x 1 . 

(c) Specify simple key transformations on each of the sets k 2 and fe 3 which 
yield derived sets of dimension not less than three when applied to the 
active domains x 2 and x s , respectively. 

(d) Augment the matrix M of part (b) by a permutation vector and a chaining 
vector suited to each of the key transformations of part (c). 

(e) Write a program which selects, as a function of a and /, the row of M 
corresponding to the argument a e x\ for j = 1, 2, or 3. 

4.4 Let t i = /(fe 2 ), where t is a key transformation such that t c= i\v(k)). The 
vector t takes on n n distinct values (where n = v(k )), which are assumed to be 
equiprobable. For n = 2, the cases are (1,1), (1,2), (2, 1), and (2,2), with 
expected scan lengths 1, f, 1, and f. 

(a) Show that the over-all expected scan length is 1.25 for n = 2. 

(b) Calculate the expected scan lengths for n = 3 and for n = 4. 

(c) Generalize the result of part (b) to show that the expected scan length 
rapidly approaches 1.5. [See Johnson (1961) for an alternative derivation.] 

4.5 Design an open addressing system for the ledger L of Exercise 4.3 and the 
key transformation of part (a). 

4.6 Program and discuss the extension of binary search to ra-way search. 

4.7 Let li be the time required to scan over the /th element of a vector x which 
is represented in a chained (or other serially-scanned) representation, and let 
be the frequency of occurrence of the argument x i% Discuss the role of the 
“standardized frequency” s =f -r l in determining the optimum ordering of the 
vector x. 

4.8 The neighbors problem requires that the near neighbors of an object in 
^-dimensional space be found. The technique used depends on the dimensional¬ 
ity and the particular criteria of adjacency. 

(a) The position of a vehicle on a turnpike is represented by the distance in 
miles of the vehicle from the south end, and p { is the coordinate of the /th 
patrol car. Write programs to: 

(i) identify the patrol car nearest an accident at position a, 

(ii) identify the two cars nearest to each other. 

(b) A three-column matrix V specifies the locations and radio call-signs of a 
fleet of merchant vessels on a flat lake, where VJ is the call-sign of the /th 
vessel, V 2 i is its distance in miles from the meridian tangent to the lake on 
the west, V 3 * is its distance in miles from the perpendicular tangent to the 
lake on the south. Write a program to determine the call-sign of the neigh¬ 
bor nearest to a distressed vessel whose call-sign c is given. 

(c) The matrix of part (b) is used to specify call-signs and locations at time t 
of a fleet of bombers over a flat territory. When each bomb is released, 
neighboring planes must be at a safe distance. Construct a program which 
will find the call-signs of ail pairs of planes within r miles of each other at 
time t. 

(d) In a certain hydrodynamic calculation, the motion of each elementary 
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volume of fluid is traced. The ith elementary volume is described by row L 1 
of a matrix: (4 [ a 3 )/L represents the three space coordinates (in a recti¬ 
linear system), and a 7 /L represents the remaining parameters. At each 
time step the parameters of volume i are redetermined by those of the four 
elements nearest it. Write a program to determine a 4 /!/ as the set of 
indices of the four nearest neighbors of element i. Hint. Attach an explicit 
index vector before sorting. 
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5 

METAPROGRAMS 


It is frequently necessary to treat a program as an argument of some 
process, as in the systematic analysis of a faulty program or in the trans¬ 
lation of a program expressed in one language to an equivalent program 
expressed in a second language. Such a process defined on programs may 
itself be formalized as a mctaprogram. 

Formally, a metaprogram is defined as a program whose domain is a set 
of programs, each element of the domain being called an argument 
program . If the range of a metaprogram is also a set of programs, the 
metaprogram is called a translator. An element of the range of a translator 
is called a function program ; i.e., a translator operates on argument 
programs to produce function programs. A metaprogram whose range is 
not a set of programs is called an analyzer. Thus an analyzer produces, not 
a function program, but data useful in the analysis or application of the 
argument program. If, for example, the instructions of a computer 
program are sorted on the data address and listed, the list brings together 
all data references to each register used and therefore facilitates analysis of 
the (possibly conflicting) uses of registers in the program. A metaprogram 
which schedules and directs the execution of other programs (and meta¬ 
programs) is called a director or supervisor. 

Four main types of translator are distinguished: compilers, assemblers, 
generators, and interpreters. A compiler accepts programs expressed in a 
given language (the argument language ) and produces corresponding 
programs expressed in a second language (the function language). 

An assembler is a special case of a compiler which is limited as follows: 
(1) the statements of the argument program are virtually independent and 
may therefore be treated one at a time, and (2) the statements of the 
argument program are simple (not compound) and need not be analyzed 
into component statements. There usually exists a fixed correspondence 
between the operations of the argument program and those of the function 
program; the translation thus consists essentially of a substitution of 
symbols for the operations and/or operands. 

A generator produces, by specialization, any one of a family of function 
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programs. Thus a single generator might, for each specified value of n , 
produce a specialized program for evaluating the function x n . The 
argument of a generator is usually viewed as two distinct parts; the 
skeleton program , which determines the family of potential function 
programs, and the specification , which determines the particular member 
of the family produced. Generators are frequently incorporated in 
compilers. 

A translator is called an interpreter if it (1) executes the segment of 
function program corresponding to a statement of the argument program 
immediately after it is produced, and (2) selects the statements of the 
argument program in a sequence determined by the execution of the 
function program. The function program itself is normally treated as an 
intermediate result, and only the outputs of the executed argument program 
are retained. The execution of an interpreter can therefore be viewed as 
follows: each statement of the argument program is first “interpreted” 
in the function language and is then executed. 

The trace program and the utility program are special cases of the inter¬ 
preter. The former executes the argument program without modification 
but produces, in addition to the normal outputs of the argument program, 
a trace of the argument program listing each instruction executed and the 
intermediate results produced. In a narrow sense, a utility program is an 
interpreter whose argument program is supplied directly by a computer 
operator via the control console. More broadly, the term is used to denote 
any program frequently used by the computer operator. 

The present discussion of metaprograms will be limited to one important 
aspect of compilers, the analysis of compound statements occurring in the 
argument program. The treatment embraces the translation between the 
common parenthesis notation and the Lukasiewicz (1951) notation which 
proves most convenient in the analysis of compound statements. 

5.1 COMPOUND STATEMENTS 

Each statement of a program can be considered as an operator which 
maps the given operands into the specified result. If the operator corre¬ 
sponding to a given statement belongs to a given set of operators p, the 
statement is said to be elementary in p. A finite program whose operators 
all belong to a set p is called a program in p. A statement which is not 
elementary in p but which can be expressed as a finite program in p is said 
to be compound in p. The analysis in p of a compound statement is the 
specification of a corresponding finite program in p. 

For example, the statement 

z<-{x + y) x r + (s - t) n 
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is compound in the operator set 

p = (addition, subtraction, multiplication, exponentiation), 

and Program 5.1a shows one possible analysis in p. Program 5.16 shows a 
similar analysis in the set 

q = (addition, subtraction, multiplication, branch). 

A metaprogram that translates all statements which are elementary in a 
set of operations p can be extended to translate statements which are 



Analysis in p Analysis in q 

(a) (6) 

Program 5.1 Analysis of the compound statement z (x + y) x r + (s — t) n 

compound in p by the addition of a metaprogram for analyzing compound 
statements. The conventions adopted for the representation of compound 
statements must, of course, be complete and precise so that interpretation 
is unequivocal. These conventions should be familiar and convenient to 
the programmer and should also permit easy analysis by a metaprogram. 
The common parenthesis notation of elementary algebra is congenial to 
programmers, whereas statements in Lukasiewicz notation are easier to 
translate and evaluate, easier to transform to an optimum form which 
minimizes the amount of intermediate data storage and execution time 
required in their evaluation or translation, and possess the simple criterion 
for well formation developed for the left list matrix of a tree in Sec. 1.23. 
The analysis of compound statements will therefore be discussed in terms 
of Lukasiewicz notation, and algorithms for translating between the 
parenthesis and Lukasiewicz notations will be presented. 
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Figure 5.2 Representations of a compound statement 
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Figure 5.2 shows several alternative representations of a compound 
statement. The tree representation (5.2 a) is perhaps the most graphic, and 
the other forms may be viewed as alternative representations of the tree. 
The common parenthesis form of Fig. 5.2e, for example, specifies the 
requisite structure primarily by grouping the nodes of each subtree within 
parentheses with the root in the middle of the group. As shown in Sec. 
3.4, the left list matrix of Fig. 5.2c can be supplanted by the simpler left 
list vector of Fig. 5.2 d, providing that the degree of each operator p is a 
known function d(p). 

The left list vector notation for a compound statement is also called 
Lukasiewicz * Polish , or parenthesis-free notation. The Lukasiewicz and 
the parenthesis notations will hereafter be referred to as ^-notation and 
^-notation, respectively. 

5.2 LUKASIEWICZ NOTATION 

Although ^-notation can be viewed as the left list vector of a tree 
representing a compound statement, it is helpful to develop an alternative 
equivalent formulation as follows. Let / and p be two disjoint sets whose 
elements are literals and operators, respectively, and whose union v = 
/ ® p is called a vocabulary. A strictly positive integral degree function 
d{p) is defined on each element of p, and each operator p of degree d 
accepts d elements of l as operands to define a result or value in /. In 
symbolic logic, for example, / = (0, 1), p = (A, V, <5(A) = <5( v) = 
2, and 6( ) = 1. Consistent with these notions, the degree of each literal 
is defined to be zero.t 

Each operator p of degree v(q) defines a function (i.e., a mapping) from 
each vector q c: / into an element y cl. This function is denoted by the 
vector /=(/?)© q. The vector / is called an JS ?-phrase of length v(f ), and 
the element y is called its value. Table 5.3 shows, for example, the re¬ 
phrases in the system for symbolic logic based on and , or , and not. The 
vector / is clearly contained in the vocabulary i>, that is, / c v. Where its 
omission raises no ambiguity, the prefix will be dropped from the term 
“.e-phrase” and from similar terms to be defined. 

A vector z c= v is called an i (^-formula of length v(z). In particular, 
every phrase is a formula. The degree vector of a formula z will be denoted 
by d(z) and defined by the relation (d(z)) { = S(z i ). 

* First proposed by Lukasiewicz (1951) and first analyzed by Burks et al. (1954). 

t The system is extended to include variables as well as literals by considering the 
vocabulary v = x 0 l ® p, where x is the set of variables, and x n (/ © p) = c(0). 
The degree of each variable is, like that of a literal, defined as zero. The domain of the 
operators is still confined to the set /, and in any legitimate algorithm each variable is 
specified as a literal by some statement before it enters as an argument. 
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Phrase 

Value 

( V, o, 0) 

0 

(V.o, 1) 

1 

(V, 1,0) 

1 

(V, 1,1) 

1 

(A , 0, 0) 

0 

(A, 0,1) 

0 

(A, 1,0) 

0 

(A, 1,1) 

1 

(-,0) 

1 

(-1) 

0 


Table 5.3 Phrases in ^-system for symbolic 
logic based on operators and , or , not 

If some infix of z is a phrase, and if the infix is replaced by its value, then 
the resulting vector y is called an ^-reduction of z. If y is any ^-reduction 
of z which cannot be further reduced, it is called a complete reduction of z 
or, since it can be shown to be unique, the complete reduction of z. Com¬ 
plete reduction of z will be denoted by J&(z). A formula z is said to be 
singular* if its complete reduction is a single literal, i.e., if v(J?(z)) = 1 and 
JS?( Z) € l. Thus q = (A, 1, V, 0, 1) and r = (1) are singular, but 5 = 
(A , 1, 1, 0) and t = (a) are not. 

For example, complete reduction of the singular formula z = (a , V, 1, 
A , 0, 1, V, 1, — , 1) may be performed as follows: 

z = (A, V, 1, A , 0, 1, V, 1, ^\_1) 

z 1 = (A, V, 1, A, 0, 1, V, 1, 0) 

z 2 = (A, V, UAJKI, 1) 

z 3 = (A,V 1 h0,l) 

* 4 = (a,M) 

z 5 = 1 

Program 5.4 shows the complete reduction of a formula z, including tests 
of singularity. 

Program 5.4. The components of the given formula z are examined in reverse 
order and assembled into a stack vector y = ( z h z i+1 , .. ., z v ), where z i is the 

* The term well formed used by Burks et al. (1954) and others is avoided here because 
singularity implies not only that the formula represents a well formed tree but also that 
the tree is singular. 
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1 <— 

5 — (v(y) = 1) 

2 -» 

s<- 0 

3 

y - €(0) 

4 

i <-v(z) + 1 

5 


i <- i - 1 

6 


y — (Zf) ©y 

7 

= 

<5(yi) : 0 

8 


j — <5(yi) + l 

9 


./ : v (y) 

10 


X — ^(« 3 7y) 

11 


y — 0») © a’/y 


Program 5.4 


1-origin indexing 

z 

Given formula. 

y 

Reduced suffix of z. 

i 

Index of z in descending scan. 

j 

Dimension of phrase to be 
reduced. 

s 

Singularity indicator. 


Legend 


Evaluation of the formula z 


component currently examined. When an operator (i.e., a node of nonzero de¬ 
gree) is first encountered, the prefix of y forms a phrase of dimension / = d^) 4- 
1, which is immediately reduced (i.e., evaluated) on step 10 and is then replaced 
by its reduced value on step 1 1. Singularity of the vector z is indicated by a non¬ 
zero value of s, which is set to one only if the exit occurs from step 1 with v(y) = 1. 
The case v(y) > 1 can occur if the formula represents a well formed but nonsingu¬ 
lar tree, i.e., if the formula contains two or more singular formulas. The exit 
from step 9 occurs if the indicated dimension of any phrase exceeds the current 
dimension of y and leaves the indicator 5 at its initial zero value. 

The singular formulas are clearly the meaningful compound statements 
in the system. Moreover, if L 2 is a singular formula and if L x = <5(L 2 ), 
then L is the left list of a singular tree. The singularity of a given formula 
z can therefore be determined from its associated degree vector d = d(z). 
The necessary and sufficient condition for singularity of the associated tree 
is simply v(d) — (+/d) == 1. As shown in Sec. 1.23, the necessary and 
sufficient condition for well formation is that all components of the suffix 
dispersion vector s defined by* 

* = (/ + □):(€ - d) 

must be strictly positive. The maximum over the components of s will be 
called the maximum suffix dispersion of z. 

* The suffix dispersion vector describes the dispersion (number of roots) of all suffixes 
of z, as may be more easily seen from the alternative expression 

s i+ 1 = v(a j /d) - (+/a J '/d). 
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5.3 THE MINIMAX FORM OF AN ^-FORMULA 

Two formulas are said to be equivalent if they have the same value for 
each possible specification of their variables. If z is any formula whose 
operators are all symmetric (i.e., whose operands commute), then any 
reordering of the component singular formulas of z which leaves the span 
of each operator unchanged leads to an equivalent formula. For example, 
since the operators A and v are symmetric, the formulas 

* = (A, V, A, V, w, A, w, v, r,q, p) 
and q = (a ,/?, V, q, A, r, V, w, A , w, v) 

are equivalent, as may be easily verified. In the tree representation this 
reordering appears as a reordering of the group of subtrees rooted in a 
common node. 

A formula whose maximum suffix dispersion is minimal with respect to 
the set of all equivalent formulas is said to be in minimax form. 

The dimension of the stack vector y employed in the evaluation of a 
formula z (cf. Program 5.4) takes on successive values equal to the number 
of roots in the tree represented by the suffix of z currently scanned. It 
therefore assumes the values (in reverse order) of the components of the 
associated suffix dispersion vector s. The maximum dimension of the 
stack vector determines, in turn, the amount of auxiliary storage required 
in the evaluation of the formula or in the compilation of a function 
program for its evaluation. It also determines, in part, the number of 
transfers of intermediate results to and from storage in evaluating the 
formula in a computer having a limited number of central registers. A 
formula in minimax form minimizes the maximum dimension of the 
stack vector and is therefore to be preferred. 

The transformation of a singular formula z to equivalent minimax form 
is based on the following theorem: if each of the (5(2^) component singular 
formulas of a 1 /z is in minimax form, then the entire formula can be 
brought to minimax form by arranging the component singular formulas 
in ascending order on their maximum suffix dispersion. 

For example, if z = (a , V, A, a, b, A, c, d, v, e,f ), then <5(2q) == 2, 
and a}jz contains two singular formulas, y 1 = ( V, A ,a,b, A , c, d), and 
y 2 = (v, £,/*), each in minimax form and possessing maximum suffix 
dispersions of 3 and 2, respectively. Moreover, 

< 1 = (*i) ®y 2 @y 1 = ( A, V, e,f v, A, a, b, A, c, d) 

is an equivalent formula in minimax form, with a maximum suffix dis¬ 
persion of 3 as compared with a value of 4 for the same function of z. 
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To establish the theorem, let z be any singular formula, let r(z) be its 
dispersion (that is, the number of roots), let 5 {z) be the maximum suffix 
dispersion of z, let d = <5^), and let 

« 7 * = y 1 © y 2 © • • • © y 1 

be the unique (cf. Program 3.9) partitioning of a l jz into its component 
singular formulas. Then y 1 © y 7+1 © • • • @ y^- 1 represents a Ar-tuply 
rooted tree and r(y i © y i+1 © * * * © y +A:_1 ) = k. Moreover, 

s(a}lz) = max [s(/) + r(y 2 ® y 3 © • • • © y d ), s(;y 2 © y 3 © • • • © /)] 

= max OO 1 ) + d — 1, s(y 2 © y 3 © • • • © y*)] 

= max [s(y) + d — 1, s(y 2 ) + d — 2, s(;y 3 © • • • © /')] 


= max [s(y) + d —./] 

3 = 1 

= d + max [s(y') - j]. 

5 = 1 

Since the component formulas are in minimax form, the $(y) are individ¬ 
ually minimal, and the maximum over s(y j ) — j is clearly minimized by 
arranging the s(y j ) in ascending order. This concludes the proof. 

To ensure that each component formula is itself optimal, it suffices to 
apply this reordering procedure in turn to the successive singular formulas 
encountered in scanning the given formula from right to left, as shown in 
Program 5.5. 

Program 5.5. The vector y is the suffix a { /z permuted to optimal form, p is its 
partition vector* (that is, ((p + a i_1 ) [ a p j)ly is the jth singular formula of y ), and 
g is its maximum suffix dispersion vector (that is, gj is the maximum suffix dis¬ 
persion of theyth singular formula of y). The main control parameter / is decre¬ 
mented on step 11, and, if it is not zero, the degree d = <5(s ? ) of the next 
component to be added to y is examined. If z t is not an operator, the branch 
to step 8 occurs with h = 1. The component is then a formula of length 1 
and steps 8-10 add it to y and make the appropriate changes in p and g. 

If z t is an operator (of degree d ), the loop 15-22 scans the vector g and reorders 
the first d component formulas of y so that their maximum suffix dispersions are 
brought to ascending order. This is accomplished by the simple, but not necessar¬ 
ily efficient, sorting process of comparing successive pairs of adjacent components 
of g and interchanging the corresponding component formulas of y (by rotation 

* The conventions used for p are those established in the subtree partitioning of 
Program 3.9. 
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5 

6 

7 

8 
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11 
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13 
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16 

17 

18 

19 

20 
21 
22 


-> s 0 
^ €(0) 
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P €(0) 
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+ f +- OL d /g - l 1 ^) 

h <-d + («/)//)! 
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P ((P x ad ) + 1) © * d /P 
;y «- (**) © ;y 


5 (KP) = 1) 
d +- d( Zi ) 

d : v(p) 
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h +-h + 1 
h : d 
gh • gh+l 

u <-(p + a A_1 ) j a^+^+i 
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gh*~>gh+l 
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1-origin indexing 

z 

Given formula. 

y 

The transformed suffix 
a i /z. 

p 

Partition vector of y; 
Pj is the dimension of 
the yth singular form¬ 
ula of y. 

g> 

Maximum suffix dis¬ 


persion of the yth 
singular formula of y. 

d 

Degree of z it 

s 

Singularity indicator. 


Legend 


Program 5.5 Transformation of the formula z to minimax form 


of the infix representing the pair) if an interchange is required. Steps 21 and 22 
effect the corresponding interchanges in the vectors p and g. The loop is termi¬ 
nated by the branch from step 17 to step 6, the first d formulas of y (forming the 
prefix a n /y, where n = ( +/a d /p)) are then in ascending order on their maximum 
suffix dispersions, and the new formula ( z © a n /y is therefore in optimal form. 
Its maximum suffix dispersion is computed by steps 6 and 7 and replaces the 
prefix a d /g (step 8) so that g becomes the maximum suffix dispersion vector of 
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( z d © y> The partition vector p is respecified by step 9. The one component of g 
remaining at the conclusion is the maximum suffix dispersion of the optimized 
statement y. 

The minimax form of a formula is, in general, not unique. It can be 
made unique, however, by using some assigned orderings of the operators, 
literals, and variables (e.g., the ordering specified by the vocabulary 
v — x © / © p) as a minor category in the reordering of component 
formulas. Such a unique form is helpful in detecting the occurrence of 
equivalent compound statements within a formula, with the aim of 
obviating repeated segments in a corresponding function program. 


5.4 TRANSLATION FROM COMPLETE 
PARENTHESIS TO LUKASIEWICZ 
NOTATION 


Ordinary parenthesis notation is complicated by the occasional or 
consistent use of certain conventions for eliding parentheses. For example, 
the expression 

(x + (y x z )) 


may also be written as 


(x + y x z) 


by the convention that multiplication takes precedence over addition, or as 


x + y x 2 , 

with the understanding that the entire expression need not be enclosed in 
parentheses. 

The problem posed by the use of such conventions can be segregated by 
considering a complete parenthesis notation in which all implied paren¬ 
theses are included, i.e., in which each operator and its associated operands 
are enclosed in parentheses. The analysis of a statement in parenthesis 
notation can therefore be performed in two steps, a translation to com¬ 
plete parenthesis notation according to the prescribed conventions, fol¬ 
lowed by the analysis of the resulting statement. The present discussion 
will be limited to expressions in complete parenthesis form. 

The complete parenthesis notation will be referred to as ^-notation and 
the terminology adopted for ~S?-notation will be extended analogously. 
Thus, z == ([, [, x, +, y, ], x, r, ]) is a ^-formula more commonly 
denoted by (x + y) x r. To avoid confusion with the normal use of 
parentheses (e.g., in enclosing vectors), brackets will be used (as in the 
foregoing vector z) to represent the ^-notation. 
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The discussion will be limited to a system of unary and binary operations 
only (i.e., d(p) = 1 or 2) and will again be illustrated by the system of 
logical operations. Assuming a vocabulary of the form v = X@l@p® 
([, ]), the rules of composition for ^-notation may be formulated as 
follows: 

1. If v e x @ /, then v is a singular ^-formula. 

2. If z is any singular formula, if u e p, and if d(u) = 1, then ([) © (w) © 
z © (]) is a singular ^-formula. 

3. If y and z are both singular formulas, if b e p, and if d(b) = 2, 
then ([)©>>© ( b ) ©'z © (]) is a singular ^-formula. 

In particular, ([, y , A , 2 , ]) is singular but (?/, A, z) is not; (y) is singular 
but ([, y , ]) is not, and ([, — , y, ]) is singular but (~, y) is not. 



Program 5.6 Translation from complete parenthesis to Lukasiewicz notation 


The translation from SP- to JS?-notation can be performed with the aid 
of one auxiliary file or stack vector. Program 5.6 shows a suitable process 
which will correctly translate any singular formula, but which includes no 
tests for singularity. It is noteworthy that all left parentheses are simply 
ignored. A similar process can, of course, be designed to use only the left 
parentheses and to ignore all right parentheses.* Any translation which 
tests for singularity clearly must use all parentheses. If the jSf- and 
notations employ different sets of operator symbols (e.g., A, V, and 
x, +, ~), the appropriate translation can easily be incorporated in the 
program. 

Program 5.6. The original statement is assumed to be recorded on a file 
with partitions \ following each symbol and with a null item and partition X 2 at 

* Oettinger (1960) analyzes three types of parenthesis notation: left , right, and com¬ 
plete. 
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the end. File 0 2 receives the resulting J^-formula in reverse order, i.e., from right 
to left. The stack file 0 3 receives each operator symbol as it is read from and 
transfers them one at a time in reverse order (i.e., by a backward read) to <I> 2 at 
each occurrence of a right parenthesis. A trace of the program shows, for 
example, that the ^-formula ([, [, x, V, 2/, ], A , [, — , ], ]) translates correctly 

into the J?-formula (#, 2 /, V, 2 , A) reversed from normal order. 

A partial test of singularity can be provided by testing each component 
for compatibility with its predecessor, the ordered pair being declared 
compatible if and only if it can occur in some singular formula. For 
example, an operator may be followed by either a left parenthesis or'a 
variable, but not by a right parenthesis or another operator. These first- 
order compatibility constraints can be expressed in terms of the following 
classes: left parenthesis, unary operator, binary operator, variable or 
literal, and right parenthesis. These classes will be denoted by [, w, b , v , and 
], or alternatively by 1,2, 3, 4, and 5. The constraints are summarized in 
the matrix M of Program 5.7. 

The test of singularity provided by the first-order constraints is not 
complete,* but can be completed by the following expedient.! The 
auxiliary file which receives the operators (file 0 3 of Program 5.7) also 
receives the left parentheses in their turn. The following tests are then 
added: 

1. Each operator is accepted and replaces the previous entry in the 
auxiliary file if and only if the previous entry is a left parenthesis. 

2. The transfer of one component from the auxiliary file to the output 
file normally occasioned by the appearance of a right parenthesis is 
accepted by the right parenthesis if and only if the component trans¬ 
ferred is an operator. 

3. The possible exhaustion of the auxiliary file is tested each time it is 
read. 

The first test prevents the acceptance of two successive operators without 
an intervening left parenthesis. At each application of the test, the corre¬ 
sponding left parenthesis is removed from the file. Since the auxiliary file 
may now contain left parentheses as well as operators, the second test is 
required to prevent their acceptance as operators. The complete testing 
and translation process is described by Program 5.7. 

Program 5.7. The current component x is read from the input file on step 8 and 
its class k is determined before the repetition of the main loop at step 5. Step 5 

* The tests provided in compilers have frequently been limited to essentially this type. 
See, for example, Carr (1959). 

t This procedure is due to Oettinger (1960). 
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Program 5.7 Translation from complete parenthesis to Lukasiewicz notation 
with full checking of singularity 


determines the first-order compatibility of k with its preceding value j. (The 
singularity indicator 5 is set to unity only at the exit on step 10.) Each component 
occasions the recording (step 6) of one item on one file—the auxiliary file 0 3 if 
k < 4, or the output file 0 2 if ^ > 4. The item recorded is the current component 
unless it is a right parenthesis. In the latter event, the variable x is first respecified 
(step 20) by a backward read from the stack file. The test on step 21 assures that 
the item read is an operator. 

If the current component is an operator, the previous item recorded on the 
auxiliary file must first be read, compared with “[,” and discarded. This occurs 
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on steps 15 and 16. The exits on steps 15 and 20 indicate nonsingularity due to 
the exhaustion of the stack file. Step 9 provides a final test to ensure that the 
stack file is exhausted when the input file becomes exhausted. Since the first 
component of any singular formula must be either a variable, literal, or left 
parenthesis, the initial setting of j to 2 on step 4 provides the appropriate initial 
compatibility condition. 

Each of the translation programs considered produces the resulting 
cS^-formula in reverse order. This is the order in which it is most easily 
evaluated and, consequently, the order in which the synthesis of a corre¬ 
sponding function program is most easily performed. Synthesis may 
therefore proceed concurrently with analysis. The analysis may, on the 
other hand, be completed first, and the output file 0 2 rewound before 
beginning the synthesis. The latter alternative allows the use of separate 
metaprograms for analysis and synthesis, and hence makes lesser demands 
for metaprogram storage. It also allows the application of an intervening 
transformation of the jSf-formula to some preferred equivalent form. 
However, as shown by Program 5.4, the transformation to minimax form 
also treats the JS^-formula in reverse order. It can therefore be performed 
concurrently with the translation from parenthesis notation. 

5.5 TRANSLATION FROM LUKASIEWICZ TO 
COMPLETE PARENTHESIS NOTATION 

The inverse translation from Lukasiewicz to complete parenthesis 
notation is, unlike the evaluation of the Lukasiewicz formula, best 
performed by a forward scan. The suffix dispersion criterion of singularity 
must then be applied in the following way. The dispersion of the entire 
statement is assumed to be one, and the dispersions of successively shorter 
suffixes are obtained by subtracting (1 — (5(a;)) for each succeeding com¬ 
ponent x. The suffix dispersion thus computed must reach zero when and 
only when the remaining suffix is null; if not, the statement is nonsingular. 
The translation of Program 5.8 provides complete checking of singularity. 

Program 5.8. The resulting ^-formula is produced on file ® 2 in reverse order. 
Each operator encountered is recorded in the auxiliary file together with a preced¬ 
ing left parenthesis, and it also causes a right parenthesis to be recorded in the 
output (steps 9-11). Each variable encountered is recorded (step 12) in the output 
file and initiates a transfer from the auxiliary file to the output file which termi¬ 
nates (step 19) only when an operator of degree two is encountered or (step 13) 
when the file becomes exhausted. In the latter event, steps 14 and 15 are executed 
as a final check on singularity—exhaustion of the stack file, exhaustion of 
the input file, and the first zero value of the suffix dispersion m must occur 
together. 
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1-origin indexing 

®i 

Input in ^-notation 
(terminated by °, X 2 ). 

O 2 

Output in ^-notation 
(reversed order). 


Auxiliary stack file. 

P 

Set of operators. 

m 

Suffix dispersion assuming 
total weight is unity. 

s 

Singularity indicator. 


Program 5.8 Translation from Lukasiewicz to complete parenthesis notation 

with complete test of singularity 
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EXERCISES 

5.1 For each of the following logical functions, exhibit an equivalent (i) tree, 
(ii) ^-formula, and (iii) ^-formula: 

(a) /O, y, z) = x a (y v 2 ). _ 

(b) g(w, x , ?/, 2 ) = (>v V (y ^ 2 )) a V (w = 2 /)). 

(c) the function of part (b), limiting the operators employed to and , or , and aw/. 

5.2 Let Z'( x , 2 /, 2 ) be a logical function of three variables, let q be an equivalent 
formula in ^-notation, and let r be an equivalent formula in ^-notation. Write 
programs to determine the intrinsic vector i( f) (cf. Sec. 7.2) as a function of 

(a) the ^-formula q. 

(b) the -^-formula r. 

5.3 Let a = (A, 0, v, 1, 0) 

b =(~, 0, V, 1,0) 
c = (A, V,0, V,J,0) 
d = (A, V,^, , t) 

e=(v,q,r f ,/) _ 

q = ([, [, q , V, r, ], A , , /, ], ]) 

r = ([,<?, V.r,], A , [, ,/,],]) 

Trace the operation of 

(a) Program 5.4 for each of the cases z = a, s = b, and z = c. 

(b) Program 5.5 for z = d. 

(c) Program 5.6 for Oj containing q. 

(d) Program 5.7 for containing q and for containing r. 

(e) Program 5.8 for 4^ containing d and for containing e. 

5.4 Write a program for translating from ^-notation to .^-notation which is 
analogous to Program 5.6 except that it ignores right rather than left parentheses. 

5.5 Write a program to extend the minimax transformation of Program 5.5 to 
the case of an operator set of the form p = p 1 © p 2 , where p 1 and p 2 are a set of 
symmetric and asymmetric operators, respectively. 

5.6 Write a program which extends the minimax transformation of Program 5.5 
to include ordering on the variables and operators so as to bring the formula to 
unique canonical form, as suggested in Sec. 5.3. 

5.7 Write a program which will recognize all identical singular subformulas 
occurring in a singular ^-formula z and which will produce a record of the 
associations in some convenient form. 



chapter ^ 

SORTING 


The order in which a set of items is arranged in a large-capacity store 
often has a marked effect on the simplicity and speed of execution of 
algorithms defined on them, and it therefore becomes necessary to sort or 
rearrange groups of items. 

The problem of sorting may be described as follows: given a vector a, 
determine the ordering vector p = 6/(k(a)) and the permuted vector 
c = pja, where k(a t ) is a numeric function defined on the components of 
a, and k(a) is the vector defined by ( k(a))j = k(a } ). The function k is 
called the key of the sorting process, and k(a ) is called the key vector 
associated with a. The key function is frequently an index in some set b, 
that is, k(a) = b i a. The components of a will also be called items \ since 
the vector a is commonly represented in a file, it will also be called a file. 

Most sorting processes determine the ordered vector c .= pja without 
explicitly determining the permutation vector p. A sorting process which 
explicitly determines and uses the permutation p is called an address table 
sort. 

Sorting processes fall into two major classes, called serial - or random- 
access , according to whether the files used to represent the original and the 
intermediate vectors produced are serial-access or not. Random-access 
processes are also called internal , for they are normally performed in the 
“internal” storage of a computer. Combinations of serial processes and 
internal processes are used, but the two types can and will be described, 
analyzed, and evaluated independently. 


Input 

Output 

Name of Process 

Single 

Single 

Duplication 

Single 

Multiple 

Classification 

Multiple 

Single 

Merging (or Merge) 

Multiple 

Multiple 

Revision 


Table 6.1 Types of file operations 
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Four types of operations on files are distinguished, according to whether 
one file or several files are used in input and in output. They are shown in 
Table 6.1. A classification (merge) involving m output (input) files is called 
an m-way classification (merge). 

6.1 SERIAL SORTING METHODS 

Copy operations 

A serial sorting process is executed as a sequence of copy operations. 
A copy operation is defined as follows: all items from a given set of input 
files are transferred to a given set of output files, and each item read from 
any input file must be transferred to some output file before a further item 
is read from the same input. 


Input Item 

Files Positions 



Output 

Files 



Figure 6.2 Copy operation 


Figure 6.2 provides a graphic description of the copy operation. The 
m components p v p 2 , . . . , p m denote storage for m items, the fcth input 
file may be read to the kt\\ item position only, and the occupant of any 
item position may be transferred to any one of the output files. Once an 
item is so transferred, the evacuated position may be refilled by the next 
item from the corresponding input file. Any copy operation can clearly be 
performed by serial input files and serial output files, with each item being 
read and recorded but once, and with no auxiliary repositioning of the files. 

A copy operation is also called rank-preserving , since it satisfies the 
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following definition of that term. [The concept of rank preservation and 
its use in the analysis of sorting processes were first introduced by 
Ashenhurst (1953).] Any item may be specified by a pair of coordinates 
(/, r ), the file coordinate f designating the file to which it belongs, and the 
rank coordinate r designating its rank (i.e., its index) in the file. Two items 
with initial coordinates (f l9 r x ) and (/ 2 , r 2 ) and with final coordinates (//, r x ) 
and (/ 2 ', r 2 ), respectively, are said to be relatable if and only if f x = / 2 and 
// = / 2 '. An operation is called rank-preserving if precedence relations 
are maintained for all relatable items, i.e., 

r x < r 2 or x < r 2 . 

Henceforth the terms merge and classification will, unless otherwise 
specified, refer to rank-preserving merge and rank-preserving classification, 
respectively. A merge in which each input file forms an infix of the output 
file is called a simple merge , and a classification in which each output file is 
formed from an infix of the input is called a simple classification. 

An m-way classification and a subsequent m-way merge together effect a 
rearrangement from a single file to a single file. If the classification and 
merge are both rank-preserving, the possible rearrangement effected is 
restricted. However, a sequence of such orderings using alternate classi¬ 
fication and merge can effect an arbitrary reordering. In particular, the 
following two important subclasses of such orderings will each be shown 
to suffice: 

1. simple classification and merge, 

2. classification and simple merge. 

A sequence of copy operations of the first type used to effect complete 
ordering on some key is also referred to as a merge sort. A sequence of the 
second type is called a column sort. 

Simple classification and merge 

An infix in a file vector for which the key is a monotone increasing 
(decreasing) function of the rank is called an increasing ( decreasing ) string. 
The length of a string is the number of items it contains, and a maximal 
string is a string contained in no longer string. A file containing a single 
maximal string is ordered on the key. 

For example, the sequence of keys 

1,3, 5, 8, 4, 7,9 

contains several increasing strings including 1, 3; 1, 3, 5, 8; and 7, 9, but 
it contains only two maximal increasing strings, 1, 3, 5, 8 and 4, 7, 9 and 
six maximal decreasing strings. Henceforth the term string will normally 
refer to a maximal string. 
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Two files, each containing one string, may be merged to produce a 
single string by selecting at each step the item with the smaller key of the 
two next available from the input files. More generally, if the inputs each 
contain n strings, and if the foregoing process is generalized to produce the 
longest possible output strings, each output string will contain precisely 
one string from each input. If the inputs contain n 1 and n 2 strings, re¬ 
spectively, then n = max (n l9 n 2 ) strings are produced in the output. A 
subsequent simple two-way classification which assigns \n -F 2\ strings to 
the first file and the remaining strings to the second, yields the greatest 
possible reduction in the maximum number of strings in any one file. 
Repetition of the merge and classification phases eventually produces an 
ordered file. 

The generalization of the process to an m-way merge and m-way classi¬ 
fication is immediate,* the optimum number of strings assigned to each 
output file by the classification process being limited to \n 4* m\. Referring 
to Fig. 6.2, the w-way merge may be described as follows: those item 
positions containing keys which equal or exceed the key last recorded on 
the single output file are said to be eligible , and the next item chosen for 
recording is the eligible item with the minimum key. When no eligible 
items remain, all positions are again made eligible and the process 
continues, initiating another string in the output file. The number of 
output strings produced is clearly the maximum of the number occurring 
in an input file. Figure 6.3 illustrates the process for m — 3. The vertical 
strokes in the figure indicate the division into maximal strings and do not 
denote information represented directly within the files. 

In any sorting procedure, the smallest subprocess which treats the entire 
set of items once is called a phase. The smallest subprocess which by 
simple iteration produces the sorting process is called a stage. A stage 
may comprise one or more phases. In the merge sort described above, for 
example, the classification phase and the subsequent merge phase together 
constitute a stage which is iterated until order is achieved; the process 
is therefore called a two-phase merge. The use of a revision operation 
(Table 6.1) permits the classification and the merge to be coalesced into a 
single phase, and the resulting process is called a single-phase merge. The 
single-phase merge requires m input files and m output files, whereas the 
two-phase merge requires only (m + 1) files—one input and m outputs 
in the classification phase, and m inputs and one output in the merge 
phase. 

The following format will be assumed for the original files in all programs 
in this chapter: the terminal item is a dummy (null) which is not to be 

* This method, commonly credited to Goldstine and von Neumann (1948), was 
presented by J. W. Mauchly in July 1946 in the Moore School lectures (1948). 
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sorted and which is accompanied by a terminal partition X m , where j is 
the index origin in use; all other items are separated by the partition X,-. 
The final dummy item facilitates the use of the immediate branching 
convention introduced in Sec. 1.22. 

Two-phase merge. The (m + 1) files used will be labeled 0 2 \ . . ., O w \ 
and Oj 2 , the last serving as initial input and as input during each classi¬ 
fication phase. Program 6.4 describes the entire merge. 

Program 6.4. The merge phase is shown on the left and the classification phase 
on the right. The heart of the former is the loop 7-17. Its operation is controlled 
by the logical vectors v and z (of dimension m ), which specify the set of exhausted 
files and the set of ineligible items, respectively. An eligible item is one which has 
been read to one of the m item positions p iy is not a dummy, (i.e., null), and 
possesses a key which equals or exceeds r , the key of the item last recorded. 

The merge phase terminates on step 7, when all files are exhausted. Step 8 
initializes the vector z (ineligible item positions) to the value v (exhausted files), 
and step 9 increments the output string count s. Each execution of the subloop 
10-14 records on the item pj having the smallest key of all eligible items. 
Steps 10-12 select and record the item pj and preserve its key as the variable r. 
Step 13 reads the next item from file j. If this exhausts the file, the branch to 
step 15 is followed. Step 15 adds j to the set of exhausted files and step 16 adds it 
(since the new item pj is a final dummy) to the set of ineligible items. Step 17 
then repeats the subloop if any eligible items remain and otherwise re-enters the 
major loop at step 7. If the files are all exhausted, step 7 branches to step 18 to 
begin the classification. If not, the production of a new maximal string is begun 
on step 8. 

If step 13 does not exhaust the file, it is followed by the decision on step 14, 
which repeats the subloop if r does not exceed the new key, and otherwise adds j 
to the set of ineligible items on step 16. 

The necessary initialization is performed by steps 3-6. Step 3 rewinds all input 
and output files. Steps 4 and 5 perform the initial read from each unexhausted 
file to the corresponding item position and respecify v to indicate any file which 
may be exhausted by the initial read itself. The vector v is itself specified external 
to the process, so that the initial set of input files may be restricted at will. On 
subsequent repetitions of the merge, all files are made available by step 2. 

The classification phase begins by terminating the output file with a dummy 
item and final partition X 2 , and rewinding it. The m input files are also rewound 
to serve as output files in the subsequent m- way classification, and the process is 
terminated by step 20 if the output string count is equal to one. Step 21 redefines s 
as the maximum number of strings to be allotted to the output files in turn, and 
step 22 reads the first item from the input file O x 2 . 

The output files are selected in ascending order by the index /. The variable /, 
which counts the strings, is initialized on step 25 and decremented on step 30 each 
time a break is indicated by the preceding comparison on step 29. When 5 strings 
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<D X 2 Output of merge; input to 

classification. 

p Item positions 

q File partitions (v=m. 

Legend 

v Exhausted files 

z Ineligible items 

s String count. 

r Key of last recorded item. 

° Final dummy item. 

i Index of output file. 

j Index of input file. 


Program 6.4 m -way merge sort 
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have been recorded, the branch from step 30 to step 24 is followed to repeat the 
process for the next output file. 

The classification phase is terminated by the occurrence of the partition X 2 on 
step 28. Step 1 then records a dummy and a final partition on each output file 
(including any which may have received no items), and step 2 resets the vector v 
to zero. 

The merge phase with which Program 6.4 begins is essential, even though 
the original data are contained in a single file, since it also serves to 
determine the string count s needed in the subsequent classification. The 
need for the string count can be avoided by using a classification process 
which merely assigns successive strings to successive files in cyclic sequence. 
This process does not satisfy the definition of a simple classification, and 
it will be given the name string classification. String classification is 
frequently more convenient to use than simple classification, particularly 
in processes such as the single-phase merge to be described next. Two 
successive strings assigned to a given file in string classification can coa¬ 
lesce to form a single string, but the probability of such an occurrence is 
small, especially in later stages. 

Single-phase merge. The two phases of the merge sort can be coalesced in 
a single revision operation employing m input and m output files. The 
two rows of files O 1 and O 2 serve alternately as input and output on 
successive stages. 

Program 6.5. The main subprocess (15-22) differs from the corresponding seg¬ 
ment (10-17) of Program 6.4 only in the control of file selection, the alternation 
of input and output being controlled by the alternator a, which alternates between 
1 and 2 (step 2) on successive stages. The classification is controlled by the variable 
/, which selects the particular output file on step 17 and which is itself cycled 
through the integers 1 to m by step 14. When all files are exhausted, the branch 
to step 2 is followed, resetting the vector v to zero, terminating the output files, 
and rewinding all files. The final output is contained in file Oj®. 

Elimination of file rewind. Each of the sorting processes described requires 
a rewind of all files between successive stages. If the files employed are 
capable of backward read, the processes can be modified so as to eliminate 
the need for rewind. Since each file is alternately read and recorded in 
successive stages, each will always be read in a fixed direction and recorded 
in the other. Since the space needed for recording is not known in advance, 
it is necessary to do all recording in the forward direction and therefore 
to read in the backward direction. The changes required will be illustrated 
by modifying the single-phase merge of Program 6.5. 

Program 6.6. Since the alternate forward record and backward read effectively 
reverse the order of all files on successive stages, alternate stages must assemble 
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o 1 

Input to odd stage 1 


o 2 

Output of odd stag 


p 

Item positions 

-v = m. 

q 

File partitions 


V 

Exhausted files 


z 

Ineligible items 


s 

String count. 


r 

Key of last recorded item. 

a 

Alternator (1, 2). 



Legend 


Program 6.5 m -way single phase merge sort 


ascending strings and descending strings, respectively. This is achieved by re¬ 
versing the algebraic sign of the key in statements 20 and 24 (by use of the alter¬ 
nator a) on even-numbered stages. 

Except when the entire process is terminated by failure to follow the branch 
from step 8 to step 11, the output files are never terminated by a partition, nor 
rewound, but are simply read backward as in statements 13 and 23. To ensure 
that the backward read of a file terminates properly, each is provided with an 
initial dummy item (step 11), and the branch on step 23 occurs on either the 
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Program 6.6 Single-phase merge without rewind 
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terminal partition X 2 or the permanent initial partition X. The entire process is 
terminated by equality of s and a — 1 on step 8, which can occur only if s = 1 
and a = 2. This pair of conditions ensures not only that the number of strings is 
unity but also that an odd number of stages has been executed, and hence that the 
final output (on O x 2 ) is in ascending order. 

Since the initial input file must normally be read forward, an exception is made 
through the agency of the variable c. On the first pass only, c = 0, and the 
reads on steps 13 and 23 are therefore forward. The forward read on the first 
pass necessitates a subsequent rewind of the input files, which is provided by 
step 6. 

A commonly used variant of the m-way merge sort (called string¬ 
doubling from its behavior in the case m = 2) treats the initial input as if 
the maximal strings contained were each of length one, and therefore 
produces strings of uniform length m in the output. These uniform strings 
may not be maximal but are treated on the next stage as if they were, i.e., 
output strings of length m 2 are produced. In general, the fcth stage pro¬ 
duces strings of uniform length m k , and flog m n\ stages are required to 
order n items. The number of stages does not depend on the initial number 
of maximal strings, and no use is made of possible inherent order in the 
original array. The comparison operations may, however, be somewhat 
simplified, since the need to test eligibility is replaced by counts of the 
items read from each file or by recording partitions between successive 
strings. If n is not an integral power of m, some of the strings will be 
shorter than the normal length. They can be expanded by dummy items, 
although the use of partitions renders this unnecessary. 

Classification and simple merge 

The classification and simple merge sort is also referred to as a column 
or digital sort, for the successive classification phases are controlled by 
successive columns (digit positions) of a positional representation of the 
key. The behavior of the process is not so obvious as the behavior of the 
merge of the previous section, and a formal proof of its ability to produce 
ordering will be given at the end of this section. Since the process is based 
on a positional representation of the key, it will be convenient to use 
0-origin indexing for all operands. 

Let k be the (nonnegative) sorting key, let d be its digital representation 
in a base b number system (that is, (be) d = k), and let q = v(d ), 
where b q exceeds the largest existing key. The complete column sort 
comprises q stages, each stage consisting of a 6-way classification 
followed by a 6-way simple merge. The classification on stage j is based on 
d v _j (theyth digit of the key counting from the low-order end), and each 
item is assigned to file O t , where i = d v _ jm The simple merge is defined 
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such that the output file has the form O 0 , 0 1? . . . , where the are 
the b output files of the preceding classification. 

Two-phase column sort. Like the merge sort, the base b column sort may 
either be two-phase (classification followed by simple merge and using 
b + 1 files) or single-phase (using 2b files); the two methods are described 
by Programs 6.7 and 6.8, respectively. 



0-origin indexing 

®0° 

Input to classification; output 
of merge. 

O 1 

Output of classification; input 
to merge (vf® 1 ) = b). 

d 

Positional representation of 
key (v(d) = q). 

b 

Base of representation. 

j 

Stage = q 

i 

Current input file. 


Program 6.7 Two-phase column sort 

Program 6.7. Steps 1-7 constitute the classification and steps 8-14 the merge 
phase. File O 0 ° is the single input to the classification and the single output of the 
simple merge. The remaining b files are denoted by 0 4, i e i°(b). 

The component of d which controls the current classification (step 6) is selected 
byy, which scans d from right to left. In the (implicit) specification of d on step 5, 
b serves as an auxiliary variable (cf. Sec. 1.19). When the classification phase ends 
with the exhaustion of O 0 ° (step 4), the output files are terminated, all files are 
rewound, and the simple merge is performed (steps 10-13) to re-collect on O 0 ° the 
results of the preceding classification. This file is then terminated by step 14, and 
the entire process is repeated from step 2 for the next lower value of j. The 
dimension of d is specified (by compatibility with e(q) on step 5) as q , and q is, 
of course, specified externally. 
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Program 6.8. The alternator a again determines the roles of the two sets of files 
(input or output) on successive stages, but because of the use of 0-origin indexing, 
it alternates between the values 0 and 1. The classification process (steps 7-11) 
differs from that of Program 6.7 only in the control exercised by the alternator a, 
and in the specification of a sequence of input files ® 0 a , Oj®,..., instead of 
the single file ® 0 °. 



Program 6.8 Single-phase column sort (output on b files) 


The program is deficient in two respects: it requires that the original input be 
on a full set of b files, and it produces a final output on b files rather than one. 
The first defect may be remedied by the use of an externally specified logical 
vector v to designate the input files which are unused in the original stage. The 
second may be remedied by a final simple merge following the q\h stage. This 
could be added as a separate program, but it can be effected more simply by 
replacing q with q - h 1 to ensure that the high order digit d 0 is identically zero and 
so restricts the final output to a single file. 

Validity of the column sort. * If d is the q- digit, base b representation of the 
key k of an item p , then the (q — y)th pass of the base b column sort assigns 
the item to file O f °, where i = d jm The subsequent simple merge collects 
the files in the order O 0 a , Oj®, . . . , Consider any two items p x and p z 

with distinct keys k ± and k 2 represented by d 1 and d 2 , respectively. It may 


This proof is due to Ashenhurst (1953). 
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be assumed, without loss of generality, that k ± < k 2 . Since the keys are 
distinct, there exists an integer r such that d r l < d r 2 and that a r \d x = 
a T /d 2 . Thus r is the (0-origin) index of the highest order column in which 
the keys differ. 

In the (n — r)th pass, the items p x and p 2 are therefore assigned to files 
and O/* such that h < i. Hence at the conclusion of the subsequent 
merge, item p 1 precedes item p 2 . In each subsequent stage the two items 
are always assigned to the same file (since d s x = d s 2 , s < r ), and the two 
items are therefore always relatable with respect to these operations. 
Since each stage is a copy operation, and therefore rank-preserving, the 
relative ordering of all relatable items is preserved, and p ± thus precedes p 2 
at the conclusion of the process. Thus for any pair of items p x and p 2 with 
keys k x and k 2 , such that k 1 < k 2 , item p x precedes p 2 in the final arrange¬ 
ment. 

Vector keys and categories . The column sort is actually based on the 
components of the vector d and only indirectly on the numerical key it 
represents. The process can therefore be generalized to any numerical 
vector key y defined on each item a t . It can be further extended to an 
arbitrary key vector b belonging to the product set 

C = c° ® c 1 ® • • • ® c ^" 1 

as follows: on the classification stage based on component b j9 the item is 
assigned to file c j t b jt An obvious generalization of the foregoing proof 
of validity shows that the resulting ordering is that of the product set C. 

The component set c j is called the jth category , c° is called the major 
category , c v{b) ~ l the minor category , and the ordering is said to be defined 
on category c v{b) ~ l within c v(&) “ 2 . . . within c°. For example, a nine- 
column employee number b = ( b 0 , b l9 b 2 , b 3 ) may be based on four 
categories, the first component b 0 representing the employee’s one-bit 
payroll classification (hourly or salaried), the second his two-decimal¬ 
digit department number, the third his two-alphabetic-character job code, 
and the fourth his four-decimal-digit identification number. The column 
sort on the base b representation of a numerical key is clearly a special case 
of a vector key in which each of the categories is the set 

It is frequently necessary to order a set of items on certain subsets of the 
given categories, and on different rankings of the categories. In general, if 
b is a vector key and m is any compatible mapping vector, then a related 
ordering may be defined on the vector d = b m . If—continuing the previous 
example—it is required to produce a list ordered by employee’s department 
within job classification, then m = (2, 1) and d = ( b 2 , bj). 

Any infix of b defines a related ordering which is actually achieved at 
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some stage of the column sort on b , and the ordering defined by it is there¬ 
fore said to be contained in the ordering b. If the orderings d° , d 1 ,... 9 d r ~ x 
are all contained in b, they can all be achieved at some stage in the ordering 
b , and the total number of passes required may be less than that required 
to achieve the r different orders independently. For example, d° = ( b 2 , b 4 ), 
d 1 = ( b 0 , b 2 , bj, and d 2 = (b 2 , b l9 b 0 , b 3 , h 4 ) are all contained in / = 
(b 0 , b 2 , b l9 b 0 , h 3 , h 4 ) and can be achieved jointly by sorting on the six 
components of / rather than on the ten components of d°, d l 9 and d 2 . 

The usefulness of this result is further extended [Ashenhurst (1953)] by 
the following fact: if a given component recurs in an ordering vector, its 
later occurrences may be ignored. For example, the second occurrence of 
b 0 in the vector / (as / 3 ) may be ignored, and / therefore contains the 
ordering d 3 = (b 0 , b 2 , b l9 b 3 , b 4 ) as well. 

The propriety of suppressing later recurrences of a component of an 
ordering vector is easily established. Let b be a given ordering vector and 
let d = b m and c = b n be two related orderings. Moreover, let d be 
obtainable from c by suppressing all later recurrences of components of b. 
This implies that the mapping vector m is obtained from the mapping 
vector n by the same process, i.e., m = (<r/n)/n. If p x and p 2 are two 
items whose (distinct) keys c 1 and c 2 agree in all components up to but not 
including theyth, then their relative order is determined by the component 
c j = b Uj . In the corresponding keys d 1 and d 2 it is clear that the first 
component in which the items differ is again b n and that the same relative 
order is therefore determined by d. Since a third ordering v — b p is also 
equivalent to the ordering c = b n if p is also reducible to m, the result 
concerning equivalent orderings can be extended as follows: two orderings 
y = z m and w — z n are equivalent if (cr/m)/m = (o/n)/n. 

Choice of number base. In the merge process of Program 6.4, the value of 
m may be chosen to suit the number n of files available, that is, m = n — 1. 
In the column sort, on the other hand, the number of files required is 
determined directly by b , the base of the number system representing the 
key. If the choice of b is otherwise arbitrary, it can be chosen as n — 1 for 
any n > 2. However, explicit execution of the base conversion indicated 
by the statement 

(be)±d<-k(p) 

is usually avoided by using the base in which the key is represented in the 
original files. If a base conversion is required, it can be performed once on 
the first stage and the resulting vector d incorporated with the item for use 
on subsequent stages. 

Base conversion may sometimes be inconvenient or impossible as, for 
example, in the case of special purpose sorting equipment devoid of 
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arithmetic units. In this event the stage required for each digit (6-way 
classification and subsequent merge) may be executed as a series of copy 
operations each utilizing fewer than (6+1) files. If, for instance, the 
digits are represented in a ranked binary rc-bit code, a series of n two-way 
classifications and two-way simple merges will effect the desired ordering 
on one digit of the key. More generally, a group of p successive binary 
digits can be treated at each pass, providing that 2 P + 1 does not exceed 
the number of files available. 

Repeated block sort. If a set of items is classified on the high-order 
column of an associated vector key, each of the resulting subvectors can be 
separately ordered and then merged in a final simple merge. Thus each of 
the subvectors forms an infix or block in the final arrangement. Such 
block sorting can be used to distribute the labor among a number of 
independent sorters. 

Block sorting can also be repeated by further classifying each block on 
the next lower-order column of the key. For a key vector of dimension q , 
q repetitions of the block sort yield a complete classification, and ordering 
can then be achieved by a subsequent simple merge. However, since the 
number of blocks produced is (except for duplicate keys) equal to the 
original number of items, the use of repeated block sorting is unattractive 
unless simplified by special schemes for keeping record of and controlling 
the numerous blocks. Two such schemes will be discussed. The radix 
exchange sort (Sec. 6.4) is appropriate to random-access storage only; the 
amphisbaenic sort is appropriate to serial files. 

Partial pass methods 

Each of the sorting schemes discussed thus far is constrained to treat the 
entire collection of items at each stage. Partial pass methods obtained by 
relaxing this requirement normally achieve a reduction in the total number 
of items handled, but at the cost of some increase in complexity. The 
partial pass methods gain their advantage by largely obviating explicit 
merge phases. 

The basic column sort gives rise to two partial pass methods of interest, 
the amphisbaenic sort and the partial pass column sort. The cascade sort 
arises from the use of partial passes in a merge sort. 

Partial pass column sort * This method achieves the effect of one stage of 
a column sort on a base 6 key with fewer than (6+1) files by using a 
sequence of partial passes. The method will be illustrated by an example 
[taken from Ashenhurst (1953)] involving four files and a decimal key. 

* Presented by John Mauchly in the Moore School lectures (1948) and treated more 
fully by Ashenhurst (1953). 
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Table 6.9. This table describes the partial pass column sort for reordering on a 
single column of the decimal key (theyth). The parenthetical expression follow¬ 
ing each file O z - indicates that it contains all items whoseyth key digit equals one 
of the enclosed integers. A second pair of parentheses indicates a second set of 
items in the file following, and grouped separately from, the first set. Thus the 
original input file ® 0 is described by ® 0 (0, 1, 2, 3, . . . , 9) or by any permutation 
of the decimal digits enclosed in a single pair of parentheses. 


Step 

Input File 

Output Files 

Pass 

Fraction 

Remaining from previous steps 

Copied 

1 

4> 0 : (0, 1,2, 3,4, 5, 6, 7, 8, 9) 

Ji; 

(0, 2, 4, 7) 
0,5, 6) 

(3, 8, 9) 

1.0 

2 

<Dj: (0, 2; 4, 7) 

<J> 2 : 0,5, 6) 

<F 3 : (3, 8,9) 

<*V 

(2, 7) 

(4) 

(0) 

0.4 

3 

<J> 2 : (1, 5, 6)(2, 7) 

<D 3 : (3. 8, 9X4) 

<I> 0 : (0) 

4>i : 

(5) 

0X2) 

(6) (7) 

0.5 

4 

<I> 3 : (3, 8, 9X4X5) 

<D 0 : (0)0X2) 

: (6)(7) 

(3)(4)(5) 

(8) 

(9) 

0.5 

5 

O x : (6X7X8) 

O 0 : (0)(1X2)(3)(4)(5) 

0 2 : (9) 

(6X7X8) 

0.3 

6 

<D 2 : (9) 

O 0 : (0)( 1 )(2)(3)(4)(5)(6)(7)(8) 
<I> 3 : 

(9) 

0.1 


Total 

2.8 


Table 6.9 The partial pass column sort 


The first step copies each item whose /th key digit is (0, 2, 4, 7) to file O l5 items 
(1, 5, 6) to file <D 2 , and items (3, 8, 9) to file 0 3 . After each step, only the previous 
input file and the next input file are rewound, in this case <1> 0 and <b v In step two, 
items (2, 7) are copied to <D 2 , and therefore follow the group (1,5, 6), as indicated 
by the separate parentheses. Similarly, items (4) and (0) are copied to <X> 3 and O 0 , 
respectively. 

Step three is preceded by a rewind of ® 2 and O x . Since items (1) and (2) occur 
in separate groups in the new input file ® 2 , they can be copied to O 0 in separate 
groups (1) (2) as indicated. Similar remarks apply to items (6) and (7) copied to 
Oj. The three subsequent steps complete the required ordering, producing in the 
original input file <D 0 the ordered array (0) (1). . . (9). The rightmost column 
shows that fraction of the original file (assuming a uniform distribution of the key 
digits) copied on each pass. The total at the bottom indicates that the expected 
execution time is equivalent to 2.8 full passes. 

The partial pass process of Table 6.9 is described more concisely by the 
0-origin matrix M of Table 6.10. Element M r s specifies the file to which 
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Digit 

0 

1 

0 


— 



Digit 

1 

2 


0 




Digit 

2 

1 

2 

0 




Digit 

3 

3 



0 



Digit 

4 

1 

3 


0 



Digit 

5 

2 


3 

0 



Digit 

6 

2 


1 


0 


Digit 

7 

1 

2 

1 


0 


Digit 

8 

3 



1 

0 


Digit 

9 

3 



2 


0 

Input 


0 

1 

2 

3 

1 

2 


Table 6.10 Matrix M describing the partial pass column sort of Table 6.9 

items with key digit s are to be assigned in the rth step of the process, for 
r e i°(6) and s e i°(10). The eleventh and last row of M specifies the input 
files, i.e., M/ is the index of the input file in the rth step. An algorithm 
based upon the matrix M is described by Program 6.11. 

Program 6.11. The subloop 14-18 performs the classification according to the 
element M/, using file / (specified by M r b on step 9) as input. When the file is 
exhausted, the branch to step 6 increments r and repeats the process unless 
r = v(M). Equality indicates completion of they'th column and causes a branch 
to step 4 to decrement j and reset r. The comparison on j is deferred to line 13 
so as to follow the termination and rewind of the new input file. The branch on 
step 10 prevents the recording of a dummy item on the original input file. The 
previous input file is rewound by step 8. 

Program 6.11 includes the two-phase column sort as a special case, for 
if (6 + 1) files O 0 , are available, the matrix shown on p. 194 

specifies a process essentially identical with that of Program 6.7. File 
corresponds to file ® 0 °, the first column of M determines the 6-way 
classification, and each of the b succeeding columns specifies the copying 
of one file in the 6-way simple merge onto file ® ? ,. 

The method of partial passes is frequently used in the sorting of alpha¬ 
betic data. Variants include the If pass-per-character method of sorting 





Program 6.11 Method of partial passes for m files 
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punched cards* and the procedure described by McCracken et al. (1959), 
p. 312. 

Amphisbaenic sort. The amphisbaenic sort (Nagler, 1959) is a particular 
arrangement of the repeated block sort employing partial passes. For a 
base b key it requires b + 1 files and proceeds as a sequence of classi¬ 
fications, with a simple merge of the last b subblocks produced occurring 
immediately after each classification on the low-order column. Each 
classification allots digits to the available output files according to the rank 
of the file index, e.g., if file O 0 is the input, the digits 0, 1, 2, . . . , b — 1 are 
allotted to files 1, 2, 3, . . . , 6, and if file 0 3 is the input, they are allotted 
to files 0, 1, 2, 4, 5,. . . , b. The files are recorded forward and (except for 
the initial input) are read backward without intervening rewind. The 
subblocks are designated by the key digits on which they have been 
classified, e.g., by 0, 1,2,. . . , b — 1, 00, 01, 02, etc. The block chosen 
for classification at each step is the one with the smallest designation 
among those not yet merged, the designations being ranked according to 
their values as decimal (6-ary) fractions. Thus block 213 precedes block 
214, which precedes block 22. 

Table 6.12 shows the steps of an amphisbaenic sort on a three-digit, 
base three key using files O 0 , O l5 0 2 , and 0 3 , with the initial and final data 
on file O 0 . The input files are designated by asterisks. Thus file O 0 is the 
input to step 1, and blocks 0, 1,2, are classified to files 0 1? 0 2 , and 0 3 as 
indicated. Step 2 classifies the smallest block (0) from file to files O 0 , 0 2 , 
and <F 3 . Step 3 classifies blocks 000, 001, and 002 to files 0 1? 0 2 , and 0 3 . 
The next step merges these blocks to file O 0 , and the following step begins 
classification of the next smallest block (01) on file 0 2 . It is clear that if 
the files are read backward, the next block to be classified is always 
immediately available. The general process for a ^-digit, base b key is 
described by Program 6.13. 

Program 6.13. 0-origin indexing is used throughout. Each classification is con¬ 
trolled by the “current” vector c (of dimension 6), whose components are the 
successive indices of the available output files. Thus c k is the index of the file to 
which digit k is classified (steps 16-19). The current vector is determined by 
step 10 so as to omit the index / of the current input file. The selection of the 
block to be classified is determined by the vector h (of dimension q ), the next 
block to be classified on digit j being determined by the prefix a i+1 /h. Because the 
classifications proceed for increasing values of j and the blocks just produced 
appear last on the files, the selection can be determined by the last component of 
the prefix alone, i.e., by h jm This is done on step 7, where the index i of the input 


Described in IBM Form 22-3177-2 Sorter Manual , p. 12. 



000-002 000 - 002,010 000-012 00 - 012,020 000-022 000 - 022 , 10 000 - 022 * 000-102 



Table 6.12 Amphisbaenic sort on 3-digit base 3 key 




O-origin indexing 



Files (/ = 0 for initial 

c 

Indices of current out- 


and final data); 


put files. v(c) = b. 


i e i°(b + 1). 

h 

a j+1 /h specifies input ^ 


b 

Base of representation. 


block to classification 


j 

dj controls current 


on digit j. 



classification. 

s 

Sj is index of input 


i 

Index of current input. 


file to classification on 

V — (j 

f 

Controls forward read 


digit j. 


and rewind of initial 

d 

Positional representa¬ 



input. 


tion of key. 



Legend 



20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 






198 Sorting §6.1 

to the preceding stage determines the vector €*/i °(b) of output indices, from which 
the new input index is selected as the h 5 th component. The successive values of h, 
/, y, s, and c listed in the example of Table 6.12 may be helpful in tracing the 
operation of the program. 

When classification on the last digit is completed, the last subbiocks are merged 
(steps 25-30) onto the output file ® 0 . The variable j must then be decremented 
and the component hj incremented modulo b (steps 31-32). When h } completes 
a cycle (becomes zero), the corresponding subbiocks are exhausted and j must be 
decremented repeatedly until the corresponding hj does not become zero. The 
main process through increasing values of j is then repeated. 

Although hj determines the input subblock for the classification on digit j, it 
determines the index of the input file only indirectly (step 7) through the vector 
of output files, itself determined by the input file / used in the classification on the 
preceding digit. When / is increasing, the value of / is simply the value from the 
preceding stage. However, when j is decremented (steps 31-32), it is necessary to 
determine the input / used in an earlier classification. A record of the value of / 
corresponding to each classification j is therefore kept (step 9) in the vector s , and 
is used to redefine / on step 33. 

Each file to be used as output (except possibly file ® 0 ) is first closed with a 
partition (steps 12-15) to demark the beginning of each of the subblocks to be 
recorded. Since backward read is to be used, two dummy items are provided so 
that the branch on \ coincides with the read of a dummy item. An extra read 
then disposes of the extra dummy. Any partition recorded on the final output 
file ® 0 before a fully classified subblock (when j = q — 1) would remain in the 
final output. Step 11 prevents this by initializing the index A: to 1 if/ — q — 1 and 
the index of the first output file (that is, c 0 ) is zero. In a similar manner, step 25 
prevents the attempt to copy file ® 0 to itself during the merge phase (steps 25-30). 
Step 28 reads the extra dummy item recorded in the partitioning operation of 
steps 12-15. 

Ail files are read backward except the initial input in the first stage. This 
behavior is controlled by the logical variable f (steps 16 and 20-21). The branch 
on step 20 fails the first time only, allowing f to be respecified as 1 and rewinding 
file ® 0 . On subsequent stages, the branch to step 23 causes the read of the extra 
dummy partition. 

Cascade sort* The cascade sort is a partial pass merge sort, with each 
stage proceeding as follows. The strings are initially distributed (unequally) 
among m of the m + 1 available files; an m- way merge to the empty file is 
performed until some input is exhausted; an m — 1 way merge to the 
newly emptied file is then performed from the remaining inputs, and so on 
to a final two-way merge. The effectiveness of the process depends on a 
particular initial distribution of the input strings. 

Table 6.14 illustrates the process for 190 strings distributed among five 
of six available files as shown in the first row. Succeeding rows show the 


Due to Betz and Carter (1959). See also Gilstad (1960) and Carter (1962). 
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Stage 

Distribution of strings i 

1 

0 

15 

29 

41 

50 

55 

1 

5 

9 

12 

14 

15 

2 

1 

2 

3 

4 

5 

3 

1 

1 

1 

1 

1 

4 

0 

0 

0 

0 

1 


Table 6.14 Cascade merge sort 


distribution of strings at the end of the succeeding stages.* The process 
requires but four passes, only the last of which is a complete pass (e.g., the 
last five strings need not be copied in stage 1). 

The power of a merge process may be defined as the (average) factor by 
which the number of strings decreases per pass, i.e., as f/s, where s is the 
number of strings whose ordering can be completed in p passes. For the 
given example, the power is approximately 3.7. This surpasses the power 
of 3.0 attainable in a three-way single phase merge sort using the same 
number of files. 

Gilstad (1960) has proposed a variant of the cascade sort (called 
polyphase) in which every phase is an w-way merge, i.e., each newly 
recorded output enters immediately as input in the following phase. Its 
power is slightly greater than that of the cascade sort. 

6.2 EVALUATION OF SERIAL SORTING METHODS 

Three major factors enter the evaluation of a serial sorting process: 
the amount of program storage required, the number of serial-access files 
used, and the execution time. The first two factors require little analysis, 
and attention will be limited primarily to the third. 

Because the execution time of a serial sorting process is normally 
determined almost completely by the time required to transfer information 
to and from the serial files, the execution time is assumed to be directly 
proportional to the number of passes of the files required. Each phase 
corresponds to a pass or (as in partial pass methods) to some fraction of a 
pass, and the number of passes per stage is determined by summation over 
the component phases. 

The constant of proportionality relating actual execution time and 
number of passes depends on such factors as the average length of the 
items, the reading and recording rate of the serial files, and (in processes 

* The /th column of the table refers not to a specific file but to that file which 
ranks yth in number of strings. 
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requiring rewinding after each phase) the rewinding speed. Since these 
factors are specific to particular equipment and particular tasks, and since 
the nature of the dependence is obvious, the present analysis is limited to 
consideration of the number of passes. 

Consideration is also given to related orderings (cf. vector keys and 
categories), which may, in the use of column sorting, be achieved more 
efficiently jointly than separately. 

Simple classification and merge 

The number of stages required in the m-way merge sorting methods 
depends on s 0 , the number of maximal strings in the original file, and on 
m. For, if Sj is the number of (maximal) strings at the conclusion of the 
yth stage, then Sj = f s s -4- m\ Since the eventual string length must be 
one, the number of stages required is given by 

r = flog™ U 

for which the approximation r == log w .s 0 will be used. The number of 
passes is then 2 r for a two-phase merge and r for a single-phase.merge. If 
the “cost” of the process is assumed to be of the form 

c = (a + m)r = (a + m) log„, s 0 , 

for some constant a , then the optimum choice of m is obtained as the 
solution of the equation 

log e m = 1 + a -4- m. 

The two-phase and single-phase methods may be compared for a fixed 
number of files as follows. Let n = 2k be the number of files,* let s be the 
number of strings, and let t x and / 2 be the execution times for the single¬ 
phase and two-phase methods, respectively. Then 

h = 2 log,,,,, s = 2 lo&ja-p s 
h log, ni s iog*s 

__ 2 log s k _ 2 log 10 k _ 2 log, k 

log s (2k — 1) log 10 (2k - 1) log, (2k — 1) 

This ratio increases monotonically from 1.26 at k = 2 to an asymptotic 
value of two. Since it exceeds unity, the single-phase process is to be 
preferred. 

Expected number of strings. Since the original file is frequently specified in 
terms of the number of items n rather than the number of maximal strings 

* An odd number n would prejudice the result against the single-phase method, since 
one file would necessarily be left idle. 
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s , it is desirable to determine the relation between n and the expected value 
of s for a random distribution of the keys. 

Let k be a vector of dimension n + 1 whose first n components are the 
successive keys of the file and whose last component is infinite, and let 
f=(k< Ik ), d = (k = Ik), and b = (k > [k). 

If, for example, 

k = 7 93558214 x, 

then 

[k = x793558214 
/ = 1010001100 
d = 0000100000 
b = 0101010011. 

Each unit component of / marks the beginning of each maximal 
ascending string in the forward direction (left to right), each unit com¬ 
ponent of | b marks the beginning of each maximal string in the backward 
direction, and a unit component of d marks each key which duplicates its 
predecessor. Consequently, +//, +/b, and +jd are the number of 
forward strings, backward strings, and duplicates, respectively. Since the 
relations <, = , and > are exhaustive and mutually disjoint, the three 
logical vectors are exhaustive and mutually disjoint, i.e.,/ v d v b = €, 
and f A d = f A b — d A b = 0. 

Consequently 

(+//) + (+/*>) + (+/<*) = v(k) = n + 1. 

Denoting the expected value of x by e(x ), it follows that 

e( + lf) + e(+/b) + e ( + /d) = n+ 1. 

Symmetry and the assumed random distribution together imply that 
e(+/f ) = e(+/b). Consequently, the expected number of strings e(s) is 
given by 

eO) = e(+lf) = (n+ 1 - e( + /d)) 2. 

If there are no duplicates, the expected number of strings is approxi¬ 
mately n/2, and the expected string length is therefore approximately two. 
Assuming a uniform distribution of keys in a range of g values, the proba¬ 
bility that d t = 1 is clearly 1 /g for all / e i 2 (n — 1). Therefore e(+/d) = 
(« - 1) H- g. 

Classification and simple merge 

The number of stages required in the column sort is equal to q , the 
number of significant digits in the representation of the key. To facilitate 
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comparison between different number bases, it is convenient to use the 
variable g denoting the range of the key. Thus in any base b , the number 
of significant digits q required in the key is given by q = [log^g], or 
approximately by q — log b g. For any pair of bases b x and b 2 , the corre¬ 
sponding number of stages q 1 and q 2 are related as follows: 

<7, = logftt g = log ff = log,,, b 2 = j ^ 

<7 2 log„ 2 g log ,,b 1 log H bi - 

A more practical form for calculation is 

_ l°gio ^2 
q 2 log 10 b 1 

The foregoing expressions are identical in form to those obtained for the 
number of stages required in the merge sort, but with the range g replacing 
the initial number of strings s 0 , and with b replacing m. Moreover, the 
number of files required depends on b in the same way that the corre¬ 
sponding merge processes depend on m. This holds for both the two- 
phase column sort (b + 1 files) and the single-phase column sort (2b files). 
The analysis concerning the optimal value of m therefore carries over 
directly to the choice of the base b , the only additional consideration being 
the possible need for base conversions on the key. The comparison 
between two-phase and single-phase processes also applies directly to the 
column sort, with the conclusion that the single-phase method is superior. 

Of the methods discussed, the column sort is the only one which shows 
significant advantages in the joint treatment of two or more related 
orderings. If x = z p and y = z q are two vector keys, and if cw'/p = a i /q, 
then the ordering defined by the key z r , for r = Uy /p will, when applied 
to the set of items ordered on y, suffice to produce ordering on x. The 
total number of columns sorted to achieve the two orderings jointly is then 
reduced by /. 

More generally, if c o'/p agrees with a selected subvector of the prefix 
a j /q and if the remaining elements of the prefix occur in c o ? /p (in any order), 
then sorting on the colurons corresponding to co*/ p may again be elided. 
More precisely, if there exist integers / and y, and a logical vector u such 
that 

w'/p = u/(a’/q), 

and (37P) 2 (“/( a V<7))> 

then the ordering x can be achieved by applying the ordering c o'/p to the 
ordering y, and the total number of columns sorted is reduced by /. It is 
assumed that neither p nor q contains any repeated components, for if 
they do, each of the later occurrences may be suppressed. 
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Since the ordering on x can be performed before rather than after the 
ordering on y, the roles of p and q may also be reversed, and the case 
showing the larger reduction may be chosen. If three or more orderings 
are prescribed, the foregoing method may be applied to evaluate each of 
the possible sequences of ordering. 

Partial pass methods 

In the absence of a general method for designing a partial pass column 
sort, its efficiency will be indicated only by an analysis of the four-file 
decimal key example of Table 6.9. If g is the range of the key, the number 
of passes is given by p x = 2.8 q = 2.8 log 10 g. This may be compared with 
the valuep 3 = 2 log 3 g obtained for the straightforward base three column 
sort, which can also be performed with four files. The ratio 

77 = (2.8 log 10 g) 4- (2 log 3 g)= 1.4 log 10 3 = 0.668, 

P 3 

indicates the superior efficiency of the partial pass column sort for this case. 
The four files can also be used for a single-phase column sort in a base 
two number system, yielding the value p 2 = log 2 # f° r the number of 
passes. Hence pjp 2 = 0.843, and the method of partial passes is again the 
more efficient. 

In the amphisbaenic sort on n items with a ^-digit base b key, b -f 1 files 
are required, and the total number of items handled in the classification 
phases is nq. In the merge phases, however, each item is handled at most* 
once, and the total number of passes is therefore less than (q + 1). This 
may be compared with the 2 q passes required in a two-phase column sort 
using the same number of files. Alternatively, conversion of the key to a 
base [(b -f 1 ) H- 2 ] representation could permit the use of a more efficient 
single-phase merge requiring approximately q log (M1)/2 b passes. 

The disadvantages of the amphisbaenic sort reside in the more complex 
program required and in the need for frequent reversal of the direction of 
the files, i.e., from forward record to backward read. The time lost in such 
reversal may be considerable for certain files. 

The power of the cascade sort is, as indicated by the example of Table 6.14, 
somewhat greater than that of the corresponding merge sort. Its behavior 
is most conveniently analyzed [in the manner developed by Carter (1962)] 
in terms of the difference equation satisfied by the number of strings 
occurring in successive states. The formulation of these equations is 
indicated in Exercise 6.19. Carter provides asymptotic solutions for cases 
of practical interest. 

* If the final output file <t> 0 is among the set of output files in the classification on the 
low-order digit, the subblock assigned to it need not be recopied in the merge phase. 
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In addition to its greater complexity, the cascade sort suffers from the 
need for a particular initial distribution of the strings, and from the 
dispersal of the file rewinds (which can be performed concurrently by most 
computers) throughout the process. Moreover, in the event of a computer 
error, a rerun from the last correct input files is much more difficult to 
program than is a corresponding rerun for a straightforward merge. 

6.3 AIDS TO SERIAL SORTING PROCESSES 

Internal sorting normally enjoys a much higher basic execution rate than 
does serial sorting, but for large volume files the limited size of internal 
storage may make serial sorting necessary. The amount of serial sorting 
may, however, be reduced by some use of internal sorting. For example, a 
preliminary internal sort can produce maximal strings whose lengths are 
limited only by the size of the internal store and thus reduce the number of 
strings presented for subsequent serial merge sorting. 

The present section is devoted to methods of reducing serial sorting by 
the auxiliary use of internal sorting. For this discussion, the only knowl¬ 
edge assumed concerning the internal sorting process is its capacity to 
order a specified number of items. 

Two classes of processes arise, one for aiding merge sorting and one for 
aiding column sorting. The aid to merge sorting is the simpler, since it 
consists merely in assembling long strings by internal sorting before 
beginning the serial merge sort. A serial column sort, on the other hand, 
may be aided by a final internal sort performed after the column sort. 

If k — k(x) is a key vector associated with x , and if m is any positive 
integer, then a serial column sort performed on the key vector 

k 1 = [k -T- me J 

produces the vector x° @ x 1 @ • • • © x p , where the vector x* contains all 
items such that kf = i. If the infix vectors x' are then copied in turn from 
the file, individually reordered on the key 

k 2 = (me)| 0 fe 

and recorded, the resulting file will be ordered on k. Table 6.15 shows an 
example for m — 4. 

If internal storage allows the internal ordering of as many as n items, 
then the reordering of the infixes x i can be accomplished by an internal 
sort provided that m is so chosen that v(x ') < n for all /. If the original 
keys are all distinct, m may be chosen equal to n. 

If the sort on k 1 is performed as a base b serial column sort, the number 
of stages required is reduced by approximately log b m from the number 
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Original Order 
* k k 1 k 2 


Ordered on k 1 
x k k 1 k 2 


Sets of Dupli¬ 
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Final Order 
x k k 1 k 2 


*! 7 1 3 
* 2 9 2 1 
* 3 2 0 2 
11 2 3 
* 5 6 1 2 
* 6 4 1 0 
* 7 3 0 3 


2 0 2 
# 7 3 0 3 
*! 7 1 3 
6 12 
*6 4 1 0 

# 2 9 2 1 

*4 11 2 3 


x 1 


*3 2 0 2 
# 7 3 0 3 
* 6 4 1 0 
# 5 6 12 
x ^ 7 13 
* 2 9 2 1 

* 4 11 2 3 


Table 6.15 Internal aid to column sort (w = 4) 


required for a corresponding sort on the original key. The subsequent 
internal sort on k 2 therefore serves as an aid to the serial column sort. 

The arithmetic operations indicated in the definition of keys k l and k 2 
may be simplified if m is chosen as an integral power of the base b of the 
original key. For, if the vector d is the ^-digit base b representation of k h 
and if m = b\ thenco'/d, and io^d are the base b representations of k t x and 
kf, respectively. The keys are therefore obtained from k by extracting the 
specified columns, and the serial sorting is reduced by exactly t stages. 


6.4 INTERNAL SORTING METHODS 

Since the range of practical sorting methods is clearly broadened by the 
use of random-access storage, internal sorting methods include all of the 
serial processes treated in Sec. 6.1. However, since the use of random- 
access storage introduces certain new problems in the execution of these 
processes, they will be reconsidered before proceeding to methods suited to 
random-access storage only. 

If the available random-access storage is divided into a number of areas 
or fields , these fields can be used in lieu of the serial files. The serial 
sorting methods then carry over unchanged except that the automatic 
self-indexing property of the serial files must be replaced by an explicitly 
programmed indexing of the corresponding fields. 

The efficacy of an internal sorting process depends not only on the speed 
of execution but also on the number n of items which can be ordered, using 
a given storage capacity c (measured in number of items). Since n is nearly 
linear in c, this property is measured in terms of the storage ratio r = c -F n. 

In using random-access storage, the effect of a simple merge or a simple 
classification can be achieved rather easily through address modification. 
Hence there is little advantage in splitting either the merge or the column 
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sort into two separate phases, and attention will be limited to the single¬ 
phase processes. The two-phase processes can prove superior if efficient 
block transfer of data is available in the execution of the program, but 
their behavior should, in any event, be clear from the treatment of the 
analogous single-phase processes. 

Simple classification and merge 

An internal single-phase merge analogous to the serial single-phase 
merge of Program 6.5 can be based on the assignment of two matrices 1 X 
and 2 X to correspond to the sets of input and output files. The rows a X i of 
the matrices a X correspond to the files O/ of the serial process, each item 
being represented by a single matrix element. Items are read sequentially 
from and to fields; for each input field a X\ an index r j indicates the 
element a X r j to be read next; for each output field &- a )X\ an index s i 
indicates the element next to be specified. The items in each field occupy 
the leading positions in the field, and the index of the first unoccupied 
element in the input field a X j is indicated by the parameter t 5 . The process 
is described by Program 6.16. 

Program 6.16. As in Program 6.5, the vectors v and z specify exhausted fields 
(files) and ineligible fields (positions), respectively. The parameters a and t are 
initially specified external to the process. At the beginning of each stage the 
vector v (exhausted fields) is specified (step 3) according to the unit components 
of t. At each stage except the first, t is respecified (step 2) by the final value of s 
from the previous stage. When only one string remains, each component of s, 
save the first, will remain at its initial value of unity, and this condition is used to 
terminate the process at step 1. The vector k represents the keys of the current 
items in the m input fields and is initially specified (step 7) by the keys of the 
column vector a X 1 of initial items of the input fields. The remainder of the 
process is closely analogous to Program 6.5. 

If the total number of items to be sorted is n , then each of the 2m fields 
a X j must accommodate n items, and the total storage allocated must be 
2 nm. The storage ratio is therefore 2m. It can be reduced to two by 
putting the output in a single field and keeping a record of the beginning 
location of each successive set of s' = \s 4- m\ strings, where s is the 
maximum number of strings in any one input field. 

Let the input be represented by the single field X a , let b be a vector of 
dimension m + 1, whose /cth component specifies the beginning location 
of the k\h set of maximal strings, for k = 1,2 ,...,/, theyth set numbering 
possibly less than s', and let b jn be the location of the first unused position 
of the field. Obviously j < m, and if j < m, the remaining undefined 
components of b are immaterial. If the j sets are to serve as inputs to an 
m -way merge, then the prefix a m /b serves to initialize the vector r of 
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Program 6.16 Internal single-phase m-way merge using 2m fields 


current input indices, and the suffix c o m /b defines the terminating 
locations t. 

The strings produced by the merge from the sets of maximal strings in 
X a can be transferred without classification to the single output field 
X 3 ~ a . If the vector b is redefined by the beginning locations in the output 
field of successive sets of |V m\ items, it can be used to define the input 
fields in a subsequent merge from the field X s ~ a . Program 6.17 shows a 
convenient arrangement of the process. 
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Program 6.17. As in Program 6.16, the vector r determines the current set of 
items being examined, and t determines the terminal value of r. The first m 
components of b initialize r (step 7), and the last m components initialize t 
(step 8). The index u determines the component of the output vector X 3-a , which 
is next to be specified (step 24). 

The remainder of the process differs from Program 6.16 primarily in the deter¬ 
mination of b. The counter h allows i and b t to be respecified (steps 19-20) at 
the end of each group of s strings. The last specified component of b is deter¬ 
mined separately by step 5. Step 6 redetermines v for the next stage. 

Since some of the initial values s 0 , r°, and b° may be unknown, the 
initialization of the process (steps 1-3) merits some attention. If s 0 alone is 
known, v° and b° may be chosen as follows: v° = a 1 ; b^ = 1, b 2 ° = v(X). 
The effect is to perform the first merge from a single input area. Con¬ 
sequently, the first stage performs no rearrangement but does determine 
the vectors v and b. 

If s 0 is unknown, s may be determined by a preliminary string count. 
Alternatively, it may be set to any integer value y > s 0 . The process 
remains valid, but the required number of stages is increased by [log m y\ — 
n^g w s 0 i 

Since s 0 cannot exceed the number of items, the initial value s = v(X) is 
always valid, and for an assumed random distribution it exceeds the 
expected value by a factor of two only. If greater initial order is expected, 
it may be desirable to modify Program 6.17 to allow a small initial choice 
of s, accompanied by automatic respecification in the event that it proves 
too small. The modification may be simply made by inserting a branch on 
equality of / and m, following step 18. The branch would succeed only in 
the event that the initial specification of s were too small and should lead 
to a process for increasing (e.g., doubling) s and repeating the process. 

The case m = 2 is of especial interest, partly because it leads to signi¬ 
ficant simplification in the storage allocation, and partly because the 
advantages which large values of m enjoy in serial sorting largely disappear 
in internal sorting. These advantages arise from the reduction in the 
number of stages with increasing m , with a nearly constant time per stage 
due to the fixed reading and recording time for the serial files. In internal 
sorting, the time required for key comparisons and address calculations in 
choosing the minimum key becomes relatively more important and, since 
the key comparison time is an increasing function (frequently linear) of m , 
the advantage may lie with small values of m. 

The simplification of storage allocation arises in the following way. A 
two-way string classification on n items may be used in conjunction with a 
single output field with a total capacity of n items by assigning the odd- 
numbered strings from the beginning of the field in normal order and the 
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Program 6.18 Two-way internal single-phase merge 


even-numbered strings from the end of the field in reverse order. Thus if 
s j is theyth string of 2k strings so classified, the output field would contain 
the array 

s 1 © s 3 © s 5 © • • • ® s 2fc_1 ® © • • • © s 6 ® s 4 © s 2 , 

->. .<- 

where the arrows indicate the increasing directions of the associated 
strings. The restriction to an even number of strings in the foregoing 
example is clearly not essential. The corresponding two-way internal 
single-phase merge is described by Program 6.18. 

Program 6.18. Since the current index vectors r (for input) and s (for output) 
may always be initialized as shown in steps 3 and 4, and since termination of a 
phase occurs when r x exceeds r 2 (step 7), explicit use of the vectors b and t of 
Program 6.17 is obviated. The only added complication lies in the different 
treatment of indices r x and s l9 which must be incremented whenever used, and of 
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indices r 2 and s 2 , which must be decremented. This treatment is effected by 
addition (steps 16-17) of the rows Q } and Q l of the matrix 



It is interesting to note that the use of indexed variables to allow comparisons 
with the larger (or smaller) of two keys (steps 9 and 11) reduces the requisite 
number of comparisons from four to three. 

The method of string-doubling also permits some simplification in 
storage allocation and address calculations. 

Classification and simple merge 

As in the case of the internal merge sort, the internal column sort differs 
from the corresponding serial sort primarily in the problem of storage 
allocation. Again, the straightforward solution lies in the allocation of 2b 
fields of n item positions each, and the use of ^-dimensional indexing 
vectors r, s , and t to control the input and output fields. The sorting 
process used is identical with that of Program 6.8 (serial single-phase 
column sort), and the indexing problems are analogous to those of Pro¬ 
gram 6.16 (internal single-phase merge using 2m fields). 

As in the corresponding merge sort using 2m fields, the foregoing process 
has a high storage ratio which can be reduced to two by a two-field process. 
Unlike the corresponding case for the internal merge of Program 6.17, 
the explicit classification process cannot be avoided. Consequently, it is 
necessary to determine in advance the size of field required for each of the 
b classes corresponding to digits 0, 1,. . . , (b •— 1). This leads to the 
so-called pre-count column sort of Program 6.19, in which each stage 
incorporates an examination of the next higher order position of the key 
and a count of each of the digits occurring. 

Program 6.19. 0-origin indexing is used, and the vectors X a and X® (for a = 0 
or 1) serve as input and output fields. The classification on the key digit d } is 
performed so that all items which agree in the /th column of the key form an infix 
in the output X®, and so that the value of d } associated with successive infixes 
increases monotonically. The output indexing is determined by the vector s, 
which is, in turn, initialized by the vector b. The value of b for the succeeding 
stage is determined by steps 13 and 14*, according to the value of the next higher 
key digit dj_ v The initial value of b is assumed to be defined externally. It must 

* Statement 14 is, for most computers, an inefficient procedure for determining b. 
Normally it is preferable to make a simple count of each of the digits and to sum the 
counts to determine s at the beginning of the next stage. 
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Program 6.19 Pre-count column sort 


be determined by a preliminary count, perhaps performed when reading the 
items into the initial field. The use of q + 1 instead of q in steps 1 and 9 ensures 
that the pre-count quantity dj_ x is properly defined even for the final stage. 

Special internal sorting methods 

The present section is devoted to internal sorting methods which are 
specifically unsuited to serial files. The storage ratio provides a major 
categorization of methods; a method either possesses unity storage ratio 
or it does not. 

Unity storage ratio is achieved by methods which proceed by the inter¬ 
change of item pairs. The type of interchange may be limited to the 
transposition of adjacent items, to “insertion” of an item accompanied by 
a movement of all intervening items toward the evacuated position, or to 
the exchange of an arbitrary pair. The corresponding methods are 
characterized as transposition , insertion , and exchange methods , respec¬ 
tively. 

Exchange methods include the radix exchange sort . This is an arrange¬ 
ment of the repeated block sort for a base two key, for which the operation 
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count (number of elementary operations required to order n items) is of the 
order of n log 2 n. 

Transposition methods include the bubble sort , odd-even transposition , 
and the ranking sort. They are characterized by relatively simple programs, 
an operation count of the order of n 2 for random initial order, and the 
capacity to utilize existing order to reduce the operation count. 

The only insertion method treated is ranking by insertion. The operation 
count (counting comparisons only and not counting the individual item 
transfers of the block movements associated with each insertion) is of the 
order of n log 2 n for random order, and is reduced by existing order. It is 
most attractive in a computer providing efficient block movement of 
items. 

Methods having a storage ratio greater than unity include the merge and 
column sorts previously discussed. One additional method of this type is 
treated—the pth-degree repeated selection sort. The operation count is of 
the order pn ( ', where q = (p + 1) p. 

Any internal sorting method can be broken into two distinct phases, the 
first utilizing only the keys to determine the permutation required on 
the items, and the second effecting the permutation of the items. Since the 
permutation vector is, in effect, a table of addresses of the items, the 
process is called an address table sort. Address table sorting is particularly 
advantageous if the volume of data in the item is large compared to the 
data in its key. 

Any sorting method in which each stage isolates the item with the 
smallest key (among the items remaining from previous stages) can be 
modified to produce longer strings by the use of one auxiliary serial input 
file and one auxiliary serial output file. The modification is called sorting 
w ith replacement. It consists in recording the selected minimum item in the 
output file and reading from the input file a replacement item which enters 
in the subsequent stages only if it is eligible for continuation of the string 
already recorded. 

The internal methods are evaluated and compared in Sec. 6.5, and the 
results are summarized in Table 6.37. 

Radix exchange. Radix exchange is a form of the repeated block sort for a 
base two key. The high-order column of the key is scanned first. The first 
zero item (item with key digit zero) is exchanged with the last unit item, the 
second zero item is exchanged with the second last unit item, and so on, 
until the first stage of the block sort is completed. The zero items now 
form a prefix in the vector of items, and the unit items form a suffix. The 
process is then repeated on the next column of the key, first on the prefix 
obtained from the first stage, and then independently on the suffix. Each 
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column of the key is treated in turn, the exchange process being performed 
independently on each of the disjoint infix vectors defined by the preceding 
stage. 0-origin indexing will be used throughout the discussion. 

Primary control of the process is exercised by the vector b , whose 
successive components specify the beginning index of successive subblocks 
(infixes). In particular, b 0 == 0, and b v _ x — v(x ), and at the beginning of 
the kth stage, v(b) = 2 fc_1 + 1. The storage required for b is therefore 
significant but may be reduced by either of two expedients. The vector b 
may be replaced by a logical vector u such that u\ i° = b. Determination 
of the next component of b then requires a scan of the components of u. 
The use of u is thus normally unattractive except in a computer equipped 
for the efficient determination of the location of the leading nonzero 
component of a logical vector. The second expedient leads to a more 
complex but more practicable process. The straightforward use of b will 
be treated first. 

Program 6.20. Steps 10-21 perform the exchange on the subblock with indices 
k in the interval b t < k < b i+1 . The indices r 0 and r x designate the pair last 
exchanged, k is the index of the current item examined, / is the current column 
of the key, and a is an alternator which is zero during the forward scan of the zero 
section and unity during the backward scan of the unit section. The alternator a 
determines which of the indices r 0 and will initialize k (step 13), the direction 
of the scan (step 14), the type of key digit (0 or 1) sought in the next item to be 
exchanged (step 18), and which of the indices r 0 and r x is to be redefined by k 
(step 19) when the search terminates through failure of the branch at step 18. 
If a does not become zero from negation on step 20, the process is repeated from 
step 13 with a = 1, producing a backward search for a unit digit in the key. If a 
becomes zero, both the forward and backward scans have been completed, and 
the required item exchange is performed on step 21. The final exit from the entire 
loop 13-21 occurs when k — r & , that is, when k = r Q on a backward scan or 
k = r 1 on a forward scan. In either event, the final value of k is the beginning 
index of the new subblock defined by the exchange, and it is used immediately to 
specify c i on step 16. The vector c is eventually meshed with b (step 6) to re¬ 
specify b as (b 0 , c 0 , b l9 c l5 ...). The initial specification of b on step 1 ensures 
that the first subblock treated is the entire vector x. 

The number of subblocks which must be distinguished at any one time 
can be reduced to the dimension of the key by a method due to Hilde- 
brandt and Isbitz (1959). The process is controlled by a partition vector p 
of dimension q + 1, whose successive nonzero components specify the 
beginning indices of the subblocks of x remaining to be ordered. At each 
stage, the first remaining subblock is exchanged on the appropriate key 
digit d j9 i.e., for j increased by one over the value used in generating the 
particular subblock. When the exchange occurs fory = q — 1, the order¬ 
ing of the two leading subblocks is complete, and they are removed from 
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Program 6.20 Radix exchange with v(b) < 2 V 


further consideration by respecifying p 0 by p u and resetting p u and p v to 
zero, where p u and p v are the first nonzero components of B}/p. The end 
of the new leading subblock is now determined by p w , the new leading 
nonzero component of a}/p, and the exchange is executed on the appro¬ 
priate column j. 

Record of the value of j appropriate to a given subblock is kept by 
recording its terminal partition as p H . This is achieved, first, by recording 
each new partition generated by exchange on column j — 1 in component 
p Q _j , and, second, by advancing the component p w (determined in the 
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Program 6.21 Radix exchange with v(p) = v(d) + 1 

prefix removal phase) to p w _ x and resetting p w to zero. Incidentally, this 
procedure ensures that p u and p v always occur as p x and p 2 . In practice, 
these two components need not be reset if the scan for p w is begun with p 3 . 

Program 6.21. The reader may find it helpful to trace the program (i.e., record 
successive values of all parameters in tabular form) for a simple case of a few keys 
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of small dimension. The exchange phase (15-22) is identical with that of Program 
6.20 except that the specification of is omitted. If j f q — 1 at the conclusion 
of the exchange, / is incremented (step 10), r 0 is reset to p 0 — 1, and the new 
partition k specifies bothp^- and r lm If j = q — 1, the prefix removal is executed 
by steps 4-8. Step 4 respecifies p 0 , and the scan of steps 5-6 (which begins with 
p 3 ) locates a nonzero component p q _ 5 which is advanced to p Q __ {j+1) (indirectly by 
steps 7, 10, and 12) and is reset to zero by step 8. Steps 0-1 provide the initial 
specification of p 0 and (via step 12) of p Q . 

Bubble sort . The basic operation of the bubble sort is the comparison and 
possible transposition of a pair of adjacent items so as to place the smaller 
of the two keys earlier in the sequence. The first stage consists of such 
operations performed in sequence on the item pairs (x v _ v x t ), (x v _ 2 , 
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Table 6.22 Bubble sort example 


. . . , (x v x 2 ). The result is to bubble each item upward in the sequence 
until it encounters an item with a smaller (or equal) key and then to leave 
it and continue bubbling the new smaller item. In particular, the smallest 
item is bubbled to the top. Successive stages repeat the process, but since 
theyth stage brings theyth smallest item to theyth position, the (y + l)th 
stage need not treat the first j positions. It is clear that v(x) — 1 stages 
suffice, but it is advantageous to allow termination at the end of the first 
stage during which no transpositions occur. 

Table 6.22 shows the arrangements prevailing at the end of each stage 
of a bubble sort. The items above the staircase line are not re-examined. 
Although the items are in correct order at the end of stage three, there is 
no available indication of the fact until stage four is executed without the 
occurrence of a transposition. 

Program 6.23. The detailed behavior of the bubble sort process described by 
this program should be clear from the foregoing discussion. It may, however, be 
remarked that at most r = r(a) — 1 stages are executed, even though the final 
order is achieved only at the rth stage. 
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Program 6.23 Bubble sort 

If a given set of items is completely ordered except for a single item 
which is displaced upward from its proper position by j places, j stages of 
the bubble sort will be required to complete the ordering. On the other 
hand, a single stage of the bubble sort performed in the alternate direction 
(i.e., scanning from x ± to x v and bubbling the large items downward) 
would suffice. In general, there is some advantage in performing successive 
stages of the bubble sort in alternate directions. 

If on a backward scan (from x v to aq) no transposition occurs between 
items Xj and Xj_ v then x j and x j+1 are in correct relative order. 
Consequently, if x j and (the possibly new) Xj_ ± are not transposed on the 
succeeding forward scan, then x j and x j+1 will suffer no transposition. This 
result may be extended to strings of items which suffer no transpositions, 
and a record of this existing order can be used to obviate the corresponding 
comparisons. 

More precisely, if s is a logical vector such that Sj = 1 if and only if no 
transposition occurred between items x j and x^_ ± in a backward scan, then 
no transposition between items Xj_ ± and x j on the succeeding forward scan 
will also imply no transposition (i.e., no need for comparison) between x j 
and x j+1 if Sj = 1. More generally, if a = ±1 is an alternator such that 
a = 1 on the forward scan, then Sj may be defined as unity if and only if 
no transposition occurs between x j and x j+a . Program 6.24 shows the 
entire process. This variant of the bubble sort is attractive only for 
computers in which the indicated scan of the logical vector can be per¬ 
formed efficiently. 
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Program 6.24. Multiplication of each key by the alternator a provides the 
required alternation in the behavior of the branch on step 10. The behavior is 
best appreciated by tracing a simple case. 
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Program 6.24 Forward and backward bubble sort with string indicator 


Odd-even transposition sort. Like the bubble sort, the transposition sort 
has unity storage ratio and involves the comparison and possible trans¬ 
position of adjacent items. Each stage consists of two phases or half-stages. 
In the first half-stage, each item with an odd index (except the last if it is 
odd) is compared with its successor; in the second, each item with an odd 
index (except the first) is compared with its predecessor. Table 6.25 
provides an example. 
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Table 6.25 Odd-even transposition sort example 
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Program 6.26. The subloop 6-10 performs the first or the second half-stage of 
the transposition sort according as the alternator a is 0 or 1. Final termination 
occurs on step 3 as soon as one half-stage is completed without a transposition 
occurring, except that a minimum of two half-stages must be executed. The 
minimum of two half-stages is assured by the initialization of t on step 1. 
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Program 6.26 Odd-even transposition sort 


The validity of the termination conditions of Program 6.26 follows from 
the following proposition: if any half-stage except the first effects no 
transpositions, the items are completely ordered. If the half-stage is odd, 
then 

k( x i) ^ k( x 2)l k( x s) ^ k( x *)l • • • • 

Since no transpositions occur, the conditions of the previous stage (which 
exists by hypothesis) also remain valid, i.e., 

k(x 2 ) < k(x^) ^ k(x^) > • • • • 

The two sets of inequalities together imply ordering.* A similar argument 
applies for the case of an even half-stage. 

In Sec. 6.5 the transposition sort is shown to be less efficient than the 
bubble sort. However, it enjoys the unique advantage that all comparisons 

* This result may be combined with the fact (established in Sec. 6.5) that the number 
of transpositions required is finite to establish convergence of the transposition sort. 
For, if the set is not ordered, each half-stage must effect at least one transposition. 
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and transpositions in a given half-stage are independent and may therefore 
be executed in parallel. 

Ranking sort . If one new item is added to a vector of items already ordered 
on a given key, the resulting vector can be ordered by ranking the new 
item, i.e., by comparing it with the items of the original vector in succession 
until the appropriate rank of the added item is determined. Moreover, n 
repetitions of this process, which draw the new items from a given vector 
of n items, will order the entire vector. Table 6.27 shows an example in 
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Table 6.27 Ranking sort example 


which the individual ranking operations are each performed by comparing 
the added item with the ranked items in turn (starting with the largest), 
moving forward by one place each item whose key exceeds that of the 
added item. 

Program 6.28. The index / controls the selection of successive items to define 
the item 2 (step 4) which is to be added to the ranked set by steps 6-9. The index / 
controls the selection of successive items of the ranked set for comparison with 2 , 
and each execution of the subloop terminates if either / becomes zero or if 
k(z) > k(Xj). In either event, step 9 inserts the new item 2 into the position x j+1 , 
which was last evacuated. From the initialization of the index / it is clear that the 
process is actually completed in K*) — 1 rather than v(x) stages. 

Ranking by insertion. Since each stage of the ranking sort ranks one new 
item in an already ranked set, the determination of its position can be 
accomplished by a binary search. This sharply reduces the required 
number of comparisons. However, once the new position is determined, 
each of the succeeding items in the ranked set must be moved down one 
place to make way for the new item to be inserted. This method is par¬ 
ticularly good where (due either to suitable facilities in the computer or to 
the use of a chained representation for the vector of items x) such block 
transfer is easy to perform. 
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Program 6.29 Ranking by insertion 
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Program 6.29. The binary search is performed by the loop 11-15, which so 
determines j that the new item x i is to be inserted after x jm Since the floor opera¬ 
tion is used on step 12,y will eventually reach the value /?, and step 14 therefore 
terminates the loop in a finite number of steps. Step 3 performs the required 
insertion by a right rotation of the infix of x bounded by Xj and x t . The cases 
where x t lies outside the previously ranked set are treated by the comparisons on 
steps 7 and 9. Incidentally, step 7 takes full advantage of any initial order in the 
items, e.g., if the set is initially ordered, steps 8-15 are never executed. 

Repeated selection sort. The process of scanning all items of a vector for 
the smallest key and transferring the selected item to a separate output 
area (in a serial or random access file) will be called selection. Repeated 
selection on the successive remaining (unselected) items will serve to 
transfer the items in the order determined by the specified key. This 
method of ordering is called a ( first-degree) selection sort. 

If the given vector of n items is divided into m subvectors of at most 
\n — m] items each, then a selection from each subvector will produce a 
vector of m items, the /th of which is the smallest of theyth subvector. A 
selection performed on the m items of the resulting vector will then select 
the smallest item of the entire original set. If the selected item came 
from the k\h subvector, it is then replaced by a further selection on the 
remaining items of the kih subvector. Repetition of the process n times 
serves to order the items. Because selection is performed on two levels, 
the process is called second-degree selection. 

In general, the smallest (first-level) item may be selected from a set of v 2 
second-level items , each of which is selected as the smallest of v 3 third-level 
items. The process can clearly be extended to any desired number of levels. 
If p levels of selection are used, the process is termed pth-degree selection or 
repeated selection. It may be represented as a singular homogeneous tree 
T of height p + 1, as illustrated by Fig. 6.30. 

Figure 6.30 shows the initial filling of the lower levels in a third-degree 
selection sort performed on the sixteen items at the top of the tree T, with 
v(T) = (1, 2, 2, 4). The keys are indicated in parentheses. The positions 
of the third level are the nodes (1, 1, 1), (1, 1,2), (1,2, 1), and (1,2, 2). 
They are first filled by items # 3 (6), # 6 (2), # 9 (1), and # 14 (4), respectively, each 
selected as the smallest among the second-level nodes of the corresponding 
subtrees T (111) , T (112) , T (121) , and T (12 2) , respectively. The first position 
of level two is then filled by # 6 (2), selected as the smallest among the second- 
level nodes in its subtree, and so forth. Figure 6.31 shows the continuation 
of the process through the selection of the first two output items. 

If e is some value which exceeds the absolute value of all keys, then the 
selection process may be made more uniform by assuming either that the 
entire top level of the tree is occupied by the items to be sorted or that 
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Figure 6.30 The tree T representing the third-degree selection sort for v(T) = 
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Figure 6.31 Continuation of the third-degree selection sort 
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unoccupied positions are filled with items having the key value e , and by 
replacing each item selected from the top level by an item with a similar 
key value. Termination of the process may then be determined by the 
appearance of such a key value at the output. Moreover, the initial filling 
may be simplified by filling all lower levels with dummy items having key 
values of (— e). These may be recognized and discarded at the output. 
The normal process will remove all the dummy items first, leaving all 
levels in the state which would be produced by a special initial fill. 

Program 6.32. The top level of the tree T initially contains the items to be 
sorted (completed if necessary by dummy items having the key value of e), and 
the remaining levels contain items with the key value (— e). The index vector s 
scans the second-level nodes of the subtree T-i^ (steps 14-18) to determine the 
index m of the node having the minimum key z. If z ^ e , step 10 replaces the 
root of the subtree by the selected node value and step 11 respecifies s to begin 
the scan of the subtree rooted in m. If z = e , then all second-level nodes contain 
dummy items with “infinite keys,” and step 9 branches to step 1 to replace the 
root of the subtree by a dummy item as well. The branch from step 12 occurs 
when the scan of the top level has been completed; it also results in the insertion 
of a dummy item. 

Since each complete scan (over all levels) begins with s = (1, 1) (steps 3,11,15), 
the resulting minimum item is brought to the root of the tree. Step 2 specifies z 
as its key, and steps 4 and 7 determine its disposal. If z = e, all legitimate items 
have been flushed from the tree, and the branch to step 5 terminates the output 
file, rewinds it, and ends the process. If z = — e, the item is a dummy initial fill 
and is discarded by skipping the recording of the output file on step 8. 

Since the selection process proceeds by levels in the tree, a corresponding 
computer program can best be based on a right list—specifically, on the 
right list node vector c 2 /]T and the dispersion vector v(T). The computa¬ 
tion of the list index r(s) required in the path tracing is described (for 
0 -origin indexing) by the recursion on the functions/and g developed in 
Sec. 1.23. This recursion yields a relatively simple computer program for 
a general homogeneous tree. It will be shown, however, that a 6-way 
rooted tree (i.e., a rooted tree with a common branching ratio b) is the case 
of greatest practical interest, and in this case the simpler recursion 

r{s) = b X r^/s) + 1 + s v 

(also developed in Sec. 1.23) can be used. Program 6.33 shows the repeated 
selection sort of Program 6.32 executed on the right node list vector r of a 
6 -way rooted tree. 

Program 6.33. The initial conditions are as assumed for Program 6.32, and the 
steps of the two programs correspond very closely. The simple index modifica¬ 
tion required from stage to stage is shown in step 11. 
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Program 6.32 Repeated selection sort 


The superiority of a common branching ratio is demonstrated as follows. 
The number of items scanned per item selected (ignoring initial fill and 
termination) is clearly the sum of the branching ratios, whereas the maxi¬ 
mum number of items accommodated in the top level is equal to their 
product. It is therefore easily shown (e.g., by induction or by the use of 
Lagrange multipliers*) that for a fixed number of items, optimum execution 
is furnished by a common branching ratio. 

Sorting with replacement . Certain of the internal sorting processes 
discussed (bubble and repeated selection) proceed in a succession of 
stages, each of which results in the selection of the smallest remaining 

* See, for example, Margenau and Murphy (1943), p. 205. 
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item. Since this item can be transferred immediately to an output area or 
serial output file, the evacuated position can be refilled by an item from a 
serial input file. Each output item can therefore be replaced by a new item 
from a serial input, and the resulting process is called sorting with replace¬ 
ment. 

If the key of the newly introduced item exceeds or equals the key of the 
last output item of a group, the new item may be treated as a member of 
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Program 6.33 Repeated selection sort of Program 6.32 executed on the right 
list node vector r of a uniform rooted b -way tree 
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Program 6.34 Repeated selection sort with replacement 
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the original group. If not, the item must be allowed to retain its position 
but must be excluded from consideration in the sorting process. In general, 
sorting with replacement allows the production of longer output strings 
than would be otherwise possible with the same internal storage capacity. 
The expected increase is discussed in Sec. 6.5. Since the process terminates 
with the original positions completely occupied by ineligible items, the 
production of a further string can be begun simply by declaring all items 
eligible. 

Repeated selection sort with replacement. In the repeated selection sort it is 
advantageous to apply the eligibility criterion at each level, i.e., to limit 
selection to keys which equal or exceed the key of the item being replaced. 
The item being replaced is, of course, the last one transferred (either to the 
output file or to the preceding level). The top level items are replaced from 
the input file or, when the file becomes exhausted, by dummy items. 
However, the use of the “infinite” dummy key value e as in Program 6.32 
raises serious difficulties, which are avoided by the use of the value 
—(e + 1). This is done in Program 6.34. 

Program 6.34. This program is very similar to Program 6.32, and only the 
essential differences will be remarked. The main scan loop (15-20) differs only in 
the added comparison with w to prevent the selection of ineligible items. The 
variable w is normally specified (step 13) as the key of the item just transferred 
out of the position being filled. However, if all items are ineligible, then 2 
remains unchanged from its initial value established by step 14, and the branch 
from step 21 to 22 occurs. The variable w is then set to — e to make eligible all 
items except the dummy fills [with key value (—(e + 1))], which enter on exhaus¬ 
tion of the input file. If only these dummies remain in the level scanned, the 
process returns again to step 22. This time, however, the branch to step 3 occurs. 

Step 3 is the file read (which replaces step 1 of Program 6.32). When the file 
becomes exhausted, the branch to step 2 replaces program statement p 3 by p l9 so 
that step 3 thereafter provides the requisite dummy keys. 

Bubble sort with replacement. A straightforward bubble sort with re¬ 
placement produces the same length strings as a first-order selection sort 
with replacement, and, indeed, differs from it mainly in the additional 
performance of item interchanges. Bubble sorting with replacement is 
therefore of interest only if the order induced in the remaining items by the 
interchanges can be used to reduce the number of items scanned. This can 
be achieved by accumulating the ineligible items in a growing suffix of the 
vector of items and restricting successive scans to the prefix of eligible 
items. This method is shown in Program 6.35. 

Program 6.35. In the loop 11-13, w denotes the current item with the smallest 
key, and it is interchanged with x i if k(w) exceeds &(**). At step 14, w is therefore 
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Program 6.35 Bubble sort with replacement 


the item with the smallest key among those scanned and (unless k(w) = e) is then 
recorded in the output file and used to define 2 , the key of the last recorded item. 
The main loop is then repeated from step 5, where w is respecified by reading 
an item from the input file. If the new item is eligible, the branch from step 6 to 
step 10 is followed; if not, the parameter j (denoting the index of the first in¬ 
eligible item Xj) is decremented and w is interchanged with the last eligible item x s 
before continuing to step 10. Since step 10 initializes i to the value j, the sub¬ 
sequent scan is limited to eligible items. 

If j becomes zero on step 7, the entire set of items including w is ineligible. 
Step 8 then restores j so as to make all items eligible and hence to start a new 
string. 

The branch from step 5 and the subsequent redefinition of step 5 on step 2 
serve to introduce “infinite” keys after the input file is exhausted. The immediate 
redefinition of j and z (steps 3 and 4) to make al) items eligible may appear 
redundant. It may be verified that this procedure (1) does not increase the number 
of strings produced, and (2) avoids the potential error of omitting the last string. 
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Address table sort . A sorting process can, in principle, be accomplished in 
two distinct phases, a determination of the permutation required and the 
execution of the permutation. In serial sorting, however, the process 
cannot be effectively divided in this way. For, because of the limitation 
to rank-preserving operations, the items could not be transferred directly 
to their final positions even if they were known. In internal sorting this 
limitation does not apply. A sorting process embodying these two separate 
phases is called an address table sort. The method offers advantages when 
the time required to transfer a complete item is large compared to the time 
required to transfer its key. 

The permutation p must be so determined that the permuted vector 
y = x p is ordered on the key. Let K 1 be the vector of keys associated with 
the given vector x, i.e., Kl = k(x t ), and let K 2 be the identity permutation 
i 1 . For example, if the successive keys of the vector x are 17, 9, 6, 11,4, 8, 
and 3, then 

17 9 6 11 4 8 3 

1 2 3 4 5 6 7 

If the columns of K are reordered (by any desired sorting process), to 
produce the matrix P such that P3 is monotone increasing in z, then 
p = P 2 is the desired permutation vector. In terms of the foregoing 
example, 

/3 4 6 8 9 11 17 

” \7 5 3 6 2 4 1 

and p = (7, 5, 3, 6, 2, 4, 1). 

6.5 EVALUATION OF INTERNAL SORTING 
METHODS 

In evaluating internal sorting methods, both the execution time and the 
internal storage requirements must be considered. For the execution 
times of the internal merge sort and the internal column sort, the analysis 
of the corresponding serial sorting process applies directly. Since their 
storage requirements have also been discussed in Sec. 6.4, the present 
section will be limited to the special internal sorting methods. 

The measures of interest in the evaluation of these methods are four: 

1 . the scan length 

2 . the number of stages required 

3. the number of transpositions required 

4. the storage ratio. 
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The number of comparisons executed is frequently used as a measure of 
execution time of a sorting process. However, since the selection of the 
keys to be compared (involving instruction modification and possibly 
extraction of key digits) usually accounts for the largest share of the time 
required in a comparison, the scan length (number of items scanned) is a 
more suitable measure. In a process which scans m items serially (as in 
each individual stage of a bubble or a ranking sort), the number of com¬ 
parisons is (m — 1) and differs but little from m. However, for m small 
(as in the successive selections in a repeated selection sort),* the ratio 
m -i- (m — 1) is significant, and the scan length is the better measure. 
Moreover, in a process which does not use a serial scan, the number of 
comparisons may differ markedly from the scan length. For example, 
each half-stage of the interchange sort requires approximately m/2 
comparisons for a scan of m items. 

The storage ratio is defined as the ratio of the number of item storage 
locations required to the number of items entering the sorting process 
(without replacement). The four measures will be determined for each 
method in turn and then compared. Table 6.37 summarizes the main 
results for an assumed random initial distribution of keys. The entire 
analysis is based on the assumption of distinct keys. 


Expected number of transpositions 


Let y(ri) be the total number of transpositions required to order all of 
the n\ possible arrangements of n items. Since the ( n + l)th item added 
in the last position may rank first, second, . . ., or last, requiring n, 
(n — 1), . . . , of 0 additional transpositions, respectively, for each of the 
n \ arrangements of the n items, then 

n 

y(n + 1) = (n + 1 )y(n) + n! £ k 

k= 0 

= (n + 1 )y(n) + Q(n + 1)!. 

The function 


y(n) = 


nl (n 2 — n) 


4 


satisfies this difference equation as well as the obvious boundary condition 
y{ 0) = 0, and the expected number of transpositions is therefore given by 

y(n) _ (ft 2 — n) 
ft! 4 

The maximum and minimum number of transpositions are (ft 2 — n)/2 and 
zero, respectively. 
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A pair of items whose relative order differs from the final arrangement 
will be called a disordered pair. Thus in the set (2, 4, 3, 1), the disordered 
pairs are (2, 1), (4, 3), (4, 1), and (3, 1). To establish order it is clearly 
necessary that each disordered pair be transposed. The sequential 
transposition of disordered pairs of adjacent items is also sufficient to 
produce ordering. Any transposition method which transposes only 
disordered pairs therefore achieves order with a minimum number of trans¬ 
positions and hence with an expected number of (n 2 — n)l 4. The ranking 
sort, bubble sort, and interchange sort all fall into this category. The 
sequence in which the transpositions occur will, however, normally vary 
for different methods. Thus the sequence of arrangements realized in 
reordering the set (2, 4, 3, 1) is (2, 4, 1, 3), (2, 1, 4, 3), (1,2, 4, 3), and 
(1, 2, 3, 4) for the bubble sort, and (2, 3, 4, 1), (2, 3, 1, 4), (2, 1, 3, 4), and 
(1, 2, 3, 4) for the ranking sort. 

Bubble sort 

In the bubble sort, both the number of stages and the scan length depend 
on the initial order. The minimum and maximum number of stages 
required are one and (n — 1), respectively. The minimum and maximum 
scan lengths are n and (n 2 + n — 2)/2, respectively. 

The expected number of stages required for a random initial order is 
determined* as follows. If, at any stage, all items are ordered except for 
one which occupies a position below (later in the sequence than) its proper 
position, then one further stage will complete the ordering. On the other 
hand, each item which appears above its terminal position at a given stage 
will be moved down by exactly one position in the next stage. Conse¬ 
quently, the number of stages required will be determined by d, the 
maximum upward displacement from its final position of any item in 
the original array. More precisely, if x is the given vector of items, y the 
corresponding vector of ordered items, p is a permutation such that 
y — x p , and d = i l — p, then d = ((e\d)/d) v 

If the maximum upward displacement is d , then (assuming that all 
keys are distinct and that the final order is consequently unique) the last 
item in the final order (i.e., y n ) can initially occupy any one of the (d + 1) 
components of the suffix io d+1 /x. Similarly, y n _ x may initially occupy any 
one of the remaining (d + 1) positions of the suffix co d + 2 /#, and so on for 
each of the (n — d) items of the suffix c o n ~ d jy. The number of possible 
initial arrangements of the items of c o n ~ d /y is therefore (d + l) (n_d) . Since 
the Pleading items (i.e., a d /y ) can occupy any one of the d remaining posi¬ 
tions without restriction, the number of possible initial arrangements with a 


* The exact expression for the number of stages is due to Friend (1956). 
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n 

c(n) 

1 

0.798 

5 

0.853 

10 

! 0.877 

15 

0.893 

20 

0.904 

30 

0.918 

40 

0.927 

50 

0.934 


Table 6.36 Coefficients for z(n) = c(n)Vnn/l 


displacement not exceeding d is clearly d\ (d + l) (n-d) . The probability 
that the maximum displacement r does not exceed d is therefore given by 


Similarly, 


and hence 


pr(r < d) = 


(d + 1 ) {n - d) d\ 


n\ 


pr(r < d — 1) 


_ d {n ~ d+1 \d - 1 )! 


n\ 


pr(r = d) = pr(r < d) — pr(r < d — 1) 


= (</ + \) {n ~ d) d\ - d {n ~ d+1 \d - 1)! 
n! 

The expected value of r is given by 

71 — 1 71 — 1 

= 2 d X pr(r = d) = ^ d x pr(r = d) 

d=0 d =1 

= -(V d(d + i) (n - <i) d\ - V d x d {n - a+1 \d - 1)! 

n!\d=i rf=i 


A change of dummy variable in the second summand (d = t + 1) brings 
the two summands to similar form and yields the result 

e T = n - z(n), 

where z(n) = — 2 s n-s s!. 

n\ s= 1 


It is shown in the appendix to this chapter that z(n) approaches the value 
V 7772/2 for large n, and Table 6.36 gives coefficients c(n) for the approxi¬ 
mation 


z(n) = c(n)V 7772/2. 
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Since one extra stage is needed to determine that the ordering has been 
completed, the expected number of stages is given by 

e Q = n + 1 — z(n). 

Since the scan length in the /th stage of the bubble sort is known to be 
n — j + 1, a similar analysis can be used to determine the expected total 
scan length e s . The result is 


2 

e $ = - - ^ -K« + 1 )z(n + 1) — nz(n), 

or approximately, 



Ranking sort 

The number of stages in the ranking sort is clearly (n — 1). The ex¬ 
pected scan length on the A:th stage is determined as follows. The item may 
rank in any one of (k + 1) positions; first, second, . . . , last, with equal 
probability. There are therefore (k + 1) cases requiring scan lengths of 
2, 3, . . . , k, (k + 1), (k + 1). The last case requires a scan length of 
{k + 1) rather than (k + 2), since the process terminates on comparison 
with the last item regardless of the outcome of the comparison. The total 
scan length for the (k + 1) cases is therefore 

/V \ , If + 54 + 2 

I?/)"'-2-• 


and the expected scan length is consequently 2 + k/2 — 
Summing over the (n — 1) stages and denoting the expected 
length by e yields the relation 


1 n-l n — 1 1 

e-20,-1) + - I k- 2 7—7 

2 k=i jt=i k + 1 

_ n 2 + In - 4 _ y 1 
4 k =ik' 


\/(k + 1 ). 
total scan 


But 


V 


1 

tr = 1 


= y + log, n + 


2 n 


1 

12 n 2 


approximately,* where y — 0.5772 • • • is Euler's constant. Hence 
n 2 + 7n 11 

e " —7-'- 577 - l°8<- n - — + — , 

4 In 12n - 


See, for example, Cramer (1951) p. 125, or Woods (1926) p. 171. 
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correct to two decimal places for all n > 1. The maximum and minimum 
scan lengths are (n 2 + n — 2)/2 and 2 (n — 1), respectively. 

The ranking sort takes advantage of initial order and the minimum scan 
length of 2 (n — 1) is achieved for an initially ordered set of items. Ranking 
by insertion with binary search, as described by Program 6.29, requires 
approximately 2 + flog 2 /| comparisons on theyth stage. Hence if n — 2 k , 
the number of comparisons required is given by 

e = (2 + k)2 k - (2 + 2 2 + 2 3 + • • • + 2 k ~ l ) 

= (2 + k)2 k - 2 k = (k + 1)2*. 

In general, then, 

e= n(\og 2 n + 1). 

For a random distribution, ranking by insertion with binary search 
requires fewer comparisons than any other method, and, in the form 

described by Program 6.29, also takes advantage of initial order. The 

insertion operation requires, of course, a number of rotations of relatively 
lengthy vectors. 

Odd-even transposition sort 

Estimates of the efficiency of the transposition sort may be obtained as 
follows. Each half-stage requires the scanning of (approximately) n items 
in n/2 comparisons, and results in reducing the displacement (either up or 
down) of each item by at most one. The fact that the reduction in the 
displacement may be zero for certain items can be illustrated with the 
initial arrangement (5, 6, 1, 2, 3, 4). The number of half-stages must 
therefore equal or exceed the maximum displacement, which, in turn, 
equals or exceeds the maximum upward displacement d used in the analysis 
of the bubble sort. Moreover, one final half-stage is required to determine 
that order has been achieved, and the expected number of half-stages will 
necessarily exceed the corresponding value obtained for the bubble sort, 
namely, n + 1 — z(n). Since the number of items scanned per stage in the 
transposition sort exceeds the corresponding number in the bubble sort, it 
follows that the transposition sort is much less efficient. Moreover, the 
transposition sort does not allow sorting with replacement. Its sole 
advantage resides in the possibility of executing all operations in a given 
half-stage in parallel. 

Repeated selection sort 

The number of items scanned per item selected in a /?th-degree selection 
sort is equal to the sum of the branching ratios of the associated tree, and, 
as already demonstrated, a minimum scan length is provided by a tree 
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with a common branching ratio m. Since m p > n, the number of items 
scanned per output item is given approximately by 

s — mp = m log m n. 

The resulting expression for the total number of items scanned (nm log w n) 
is similar in form to the corresponding result for the m-way merge sort. 
The indicated optimum choice of m is Napier’s number e. 

The optimum integral value of m is three, and its efficiency differs from 
the theoretical optimum by less than 1 %. The value m = 2 simplifies the 
required program and increases the expected amount of scan by only about 
5 %. This case (m = 2) is referred to as a tournament sort. Larger values of 
m may prove advantageous where the groups of items to be scanned are 
contained in a serial store whose scan time is not significantly reduced by 
reducing m. 

Since p item transfers are required per item selected, a more realistic cost 
function for determining the optimum value of m may be given by the 
function 

c = (m + a)p = (m + a ) log m n , 

where a is the ratio of the time required for an item transfer to the time 
required for the scan of a single key. As remarked in Sec. 6.2, the optimum 
value of m is obtained as a solution of the equation 

log e m — 1 + — . 

m 


The amount of item storage required for a /?th-degree selection sort is 
given by 


a;=l + m + m 2 + *- * + m ?> = 


m p+1 - 1 
m - 1 


mn 



since n = m v . The storage ratio r(m) is therefore given by r(m) = 
ml(m — 1), a function which decreases with m, rapidly approaching unity. 
This ratio also represents the increase in execution time entailed in the 
initialization of the lower level positions. If sorting with replacement is 
used, initialization is required for the first string only. 

The ratio r(m) changes significantly for the first few values; thus 

r( 2) = 2.00; r( 3) = 1.50; r( 4) = 1.33. 

If the expected scan time s(m) is taken as a second criterion, then (since 
s( 2) = *(4) > s(3)), the value m = 2 will be eclipsed by both 3 and 4. 
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Sorting with replacement 

The use of replacement with any internal sorting method offers the 
advantage of increasing the expected length of the strings produced. 
Gassner (1958) has shown that for n item positions and random keys, the 
expected length of the first string is {e — 1) x n = 1.718/?, and that for 
later strings the expected length rapidly approaches In. 

Comparison of internal sorting methods 

The results of the preceding sections are summarized in Table 6.37. The 
ranking, bubble, and transposition processes show the most favorable 
storage ratio, exactly one half of the best attainable in the merge and the 
column sort processes. In the case of repeated selection, the storage ratio 
depends on m and ranges from two to a limit of one. 

The execution time is approximately proportional to the function 

/(«) = S + ct, 

where s is the scan length, t the number of item transfers or transpositions, 
and c the ratio of the cost (in time) of one item transfer to the cost of a scan 
of one key. For any given method with variable parameters (such as the 
repeated selection sort), these parameters may be chosen so as to minimize 
f(n). The choice between various methods may then be made (subject to 
storage considerations) so as to minimize f{ri). 

Since the ranking, bubble, and odd-even transposition methods share 
the same number of transpositions, the choice between them depends on 
the scan length and auxiliary factors. The odd-even transposition sort is 
inferior to the bubble sort in this regard, and the bubble sort is, in turn, 
inferior (by a factor of two for large n) to the ranking sort for n > 8. The 
bubble sort retains the advantage that sorting with replacement may be 
used, and the transposition method allows parallel operation. 

APPENDIX TO CHAPTER 6 

The following derivation of the limit of the function 

«(«) = — 2 s! s n ~ s 
n\ s= i 

arising in the evaluation of the bubble sort was suggested by Robert 
Church. Clearly 

L(n) = I g(s) ds < z(n) <U{n)= f #(s + 1) ds, 

Jo Jo 
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$! 

where ^(O = — s n ~ s . Moreover, 


g(s) ds, 


and since ^( 5 -) is monotone increasing with a large positive derivative for 
large values of s, only the upper end of the integral is significant. It can 
therefore be shown that 

lim U(ri) = lim L(n). 

n-* 00 n-*co 

Consequently, 

lim L(n) = lim z(n). 


Applying Stirling’s formula, 

s+5 Vy- s ds 


lim L(n ) 


^ ( n j2irs^ 
Jo J2irn 


y j2im n ^ A e- n 
Setting t — \ — s\n yields 

lim L{n) = e n n- {n+ ' A) 


= e n 


f 


-(n+ 14 ) | c n+ 1 / ^e -s ds 


\ n n + ' A ( 1 - 0” 
Jo 

= n f 1 (1 — t) n + ' A e nt dt, 

Jo 

= n fV‘ + (n+ 

Jo 


+ ' A e~ n e nt n dt 


hH)log e (i-<) 


Expanding log e (1—0 yields 

lim L(n) = n f g-i/*-<.+H>«W/s+ •• •) dt 

it-* 00 *0 

For n large, only small values of t will be significant, and all terms in the 
exponent may be dropped except — (n + l/2)r 2 /2. Similarly, the upper 
limit of integration may be extended to infinity. Hence, 

lim L(n) = n f e~ {n + A)t2/2 dt = n(7r/(2n + 1))' A 

n~* 00 Jo 

= (nnllj A . 
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EXERCISES 

6.1 A given file contains a set of 15 items x u ..., x 15 with associated decimal 

keys 68, 9, 60, 14, 60, 73, 79, 15, 67, 5, 9, 41, 57, 9, 41. For each of the serial files 

used, show the contents at the conclusion of each stage of the following processes: 

(a) a two-phase classification and merge using 

(i) four files. 

(ii) three files. 

(iii) six files. 

(b) a string classification and merge using four files. 

(c) a single-phase merge using 

(i) four files. 

(ii) six files. 

(d) a single-phase merge without rewind. 

(e) a base ten column sort using eleven files. 

(f) a base ten column sort using twenty files. 

(g) a base ten column sort using four files and the partial pass column sort. 

(h) a column sort using four files and a base three representation of the keys. 

6.2 Modify Program 6.4 so that it will work properly without dummy terminal 

items (i.e., each terminal partition X 2 is to be associated with a legitimate item). 

6.3 Write a program for the string-doubling merge sort. 

6.4 (a) Write a program for a base b column sort which uses backward read to 

eliminate rewinding. 

(b) Program a variant of the two-phase column sort (6 + 1 files) so as to 
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eliminate the copying of the “zero items” in each merge phase. Deter¬ 
mine the relative efficiency of the method. 

(c) Program an analogous variant of the ra-way two-phase merge sort so as 
to eliminate the copying of part of the items in the classification phase. 

6.5 Construct the matrix M (cf. Table 6.10) specifying an efficient partial pass 
column sort for the following cases: 

(a) base ten and three files. 

(b) base ten and five files. 

(c) base eight and three files. 

(d) base ten and four files using no rewind, i.e., files are to be read backward. 

(e) base ten and three files using no rewind. 

6.6 Using a set of matrices 1 M, 2 M,..., of the form of Table 6.10, write a 
program to generalize the partial pass sort to the case of a mixed radix key. 

6.7 (a) Reprogram the amphisbaenic sort (Program 6.13) so as to reverse the 

order of the final output. (This is the form used in Nagler, 1959.) 

(b) Program a generalization of the amphisbaenic sort which makes use of 
partial passes within columns of the key. 

6.8 (a) Program a modification of the bubble sort process which on odd- 

numbered stages bubbles the smallest item to the leading end and on 
even-numbered stages bubbles the largest item to the tail end. 

(b) By examining all 4! cases show that for four items the expected number 
of stages is slightly less for the method of part (a) than for the un¬ 
modified bubble sort. 

(c) Program a bubble sort using a string indicator s as in Program 6.24 but 
using backward scan only. 

6.9 (a) Program a modification of Program 6.17 to specify s 0 = 2 and to auto¬ 

matically double s 0 and restart if necessary. 

(b) Compare the efficiency of the program of part (a) with that of the 
straightforward program in which the number s of strings is assumed 
equal to the number of items. 

6.10 (a) Derive the relation log e m = 1 4- aim which must be satisfied by an 

optimal value of m in order to minimize the cost function c = (m + a) 
log w n (cf. Sec. 6.2). 

(b) Determine the optimal integral value of m for each of the cases a = 0, 

he 2 . 

6.11 For the amphisbaenic sort on a set of b l items with distinct keys belonging 
to the set determine 

(a) the number of item transfers required. 

(b) the number of file reversals (change of direction of read and record) 
required. 

6.12 Write a program describing the odd-even transposition sort in terms of 
vector operations so as to show clearly the parallel nature of the process. Treat 
all items with odd indices (plus a dummy item) as a vector and all even items (plus 
a dummy item) as a second vector. 
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6.13 (a) For each of the following sorting methods, determine whether initial 

order of the items is preserved, i.e., whether the relative initial order of 
all item pairs with duplicate keys is preserved. 

(i) simple classification and merge. 

(ii) classification and simple merge. 

(iii) ranking sort. 

(iv) bubble sort. 

(v) odd-even transposition. 

(vi) radix exchange sort. 

(b) Prescribe a simple modification of the key which will ensure the preser¬ 
vation of initial order. 

6.14 For the sequence of keys given in Exercise 6.1, show the explicit execution 
of the following internal sorting processes: 

(a) bubble sort with replacement (six item positions). 

(b) /?th-degree selection sort with replacement, with p = 3 and m — 2. 

(c) ranking by insertion. 

6.15 If the key is represented in a base b number system, with each digit repre¬ 
sented in turn in a ranked binary code, then ordering can be achieved by a base 
two column sort on successive columns of the underlying binary representation. 

(a) Show more generally that ordering can be achieved by a base 2 k column 
sort on (the base two value of) successive groups of k binary digits. 

(b) Program the process suggested in part (a), including the determination of 
an optimum value of k for a given number n of available files. Assume an 
original key of q digits, each represented by r binary components. Do not 
neglect the problem of terminal conditions. 

(c) If b = 10, r = 4, and if the successive decimal digits are coded alternately 
in a (ranked) bi-quinary and qui-binary system, the binary digits can be 
grouped in twos and threes so as to allow column sorting with a maximum 
of five output files. Program a suitable process. 

6.16 Program a sequence of rotations of infixes of the vector x which will 
reverse the order of its components. (See the Ranking by insertion program 
(6.29) for the case when the key defines a complete reversal of order.) 

6.17 Assuming that an item transfer takes c times as long as a comparison of 
two keys, determine a criterion for the choice of m in an m- way internal revision 
merge for the following cases 

(a) assuming 2m comparisons per item (m comparisons for eligibility and m 
for minimization). 

(b) assuming that a ranking sort is used on the m item positions. 

6.18 (a) Let z be a vector key of dimension three whose successive components 

represent department number, job number, and name, respectively. 
Two lists are to be produced, one ordered by name within department 
within job and the other by department within job within name. Deter¬ 
mine a mapping vector p such that y = z p is the vector key of least 
dimension which contains the two required orderings. 

(b) Let y = z p i be a set of vector keys defining a set of related orderings. 
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Determine a vector key of minimum dimension which contains all of 
the related orderings for the case p 1 = (3, 4, 1, 5 ),p 2 = (3, 1, 5, 6), and 
P 3 =(1,4, 3, 5). 

(c) Analyze the effect of related orderings on the expected execution time 
of a merge sort. (Consider the effects of duplicate keys on expected 
string lengths.) 

6.19 Let D be a 0-origin matrix of the form of Table 6.14 which describes the 
cascade sort, i.e., row D j describes the distribution of strings at the completion of 
stage /. Using the special matrices of Sec. 1.13, write concise expressions for the 
matrices F and B such that 

(a) D J_1 = B + & 

(b) = F + D j_1 

(c) Show that F and B are inverse. 

(d) Determine the dominant eigenvalue of B when v(B) = 3, and show its 
relation to the power of the cascade sort for four files, (cf. Sec. 6.1 and 
Carter (1962).) 

6.20 Determine the relative efficiencies of serial column sorting and serial merge 
sorting for the following conditions. Internal sorting, with a maximum of 100 
item positions, is to be used as an aid to each of the processes, and the time for the 
internal sorting is assumed fixed. There are 10,000 items with 4-digit decimal 
keys, and each key value is associated with at most four items. The initial arrange¬ 
ment contains 3500 maximal (increasing) strings. 

6.21 Program an address table sort. 

6.22 (a) The determination of the permutation vector required in the address 

table sort can be considered as a mapping from each item onto its rank 
in the set. Show that for distinct keys this mapping can be performed 
by counting for each item the number of items having a smaller key. 
(b) Program the method of part (a). (This is known as a counting sort.) 

6.23 (a) Program a two-phase internal merge sort. 

(b) Program a two-phase internal column sort. 

6.24 Program an extension of Program 6.33 (/?th-degree selection executed on 
the right list node vector c 2 /]T) to 

(a) sorting with replacement. 

(b) the case of a singular homogeneous tree with dispersion vector v(T). 

(c) cover both cases (a) and (b). 

6.25 If the transfer from a serial file can proceed concurrently with other opera¬ 
tions, it is frequently advantageous to associate two fields of internal storage 
(called buffers ) with each file and to transfer the next group of items to one of the 
fields while executing necessary operations on the items of the other. Buffers may 
be used similarly for output files. 

(a) Program an m- way single-phase merge using two buffers for each of the 2m 
serial files. 

(b) Program a base b single-phase column sort using two buffers for each of the 
2b serial files. 
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6.26 If the number of input buffers serving m files is reduced to m 4- 1 some 
advantage may still be gained by “predicting” the file whose (currently) associ¬ 
ated buffer will next be exhausted, and initiating a transfer from it to the idle 
buffer. Repeat parts (a) and (b) of Exercise 6.25 for (m 4-1) and (b 4- 1) input 
buffers, respectively. (See Friend, 1956.) 

6.27 Repeat parts (a) and (b) of Exercise 6.25 with the number of output buffers 
also reduced as in Exercise 6.26. 

6.28 Since a given initial arrangement may be easier to bring to descending than 
to ascending order on the keys, and since a final reversal of order may be easy to 
achieve (by backward read in the case of serial files or by address modification in 
the case of internal storage), it may be advantageous to choose ascending or 
descending order according to some estimate based on the initial arrangement. 
Write a program which first counts the number of ascending strings and then 
executes a ranking sort by insertion to produce either ascending or descending 
order according to which appears to be the easier to achieve. 

6.29 For the first few values of n, compute and compare the following alterna¬ 
tive evaluations of the expected number of stages in a bubble sort 

n s n—s s J 

(a) n 4- 1 — z(n ), where z(n) = Y -— . 

_ s=i nl 

(b) n 4- 1 — Vtt/z/2. 

(c) c/n\ 9 where c is the total count of all stages required for the n\ possible 
initial arrangements of n distinct keys. 
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THE LOGICAL CALCULUS 


The present chapter develops two fundamental areas of symbolic logic: 
canonical forms and the basic procedures of decomposition. 0-origin 
indexing is used throughout. 


7.1 ELEMENTARY IDENTITIES 


Certain elementary identities will first be summarized for reference. 
The first of them (equation 7.1) merely defines a matrix of operators 
employed in equation 7.4. 


De Morgan’s laws 



¥=l x = 2 | +/x 

H x = ((* a y) 7*= (* a y)) 
^ix = ((xty)^(xty)) 


Z = (a a c) A (q X f) = (a A q) % (c A r) 
v/v//(*£y) = (v/#) A (v/y) 

#/^//(* A y) = (^/*) A (^/y) 


(7.1) 

(7.2) 

(7.3) 

(7.4) 

(7.5) 

(7.6) 

(7.7) 

(7.8) 

(7.9) 
(7.10) 


Identities 7.2, 7.3, and 7.5 may be established by induction on the 
dimension of x. Equation 7.4 summarizes the sixteen identities obtained 
by extending equations 7.2 and 7.3 to arrays. For example, if ®/* and 
® k j are the operators ^ and A, respectively, then equation 7.4 becomes 

X t y = FT?. 

246 
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The foregoing relation may be verified as follows: 

(x t iv = #/(** a y,) 

= =/(X i A y ; .) (by equation 7.3) 

= =/(X i V y,) (by equation 7.2) 

= (* v *0/. 

Equation 7.7 is a direct extension of equation 7.6, which is itself derived 
as follows. Since the operator ^ is associative and commutative, then 

= ((#/(y/#» ^ (#/(>/*))• 

Moreover, 

^/y/x = 2 |(+/5V*) = 2 |(+/(* A y)) = #/(* A y), 

the first and second and the third and fourth limbs being related by 
equation 7.5. Consequently, 

#/* = ((#/(* A 3 s )) # (#/(* A y))) 

= ((* a y) # (* a y))- 

The following argument establishes equation 7.8. By definition, 

Z/ = (a i A c,) A (<L A r,) 

= 0* A <?,) A (c, A r,) 

= (* A <?), A (c A r),. 

Consequently, Z = (a A q) ° (c A r). 

Equation 7.9 is obtained by noting that if M = x % y, then M j = 
# A y,c. Then 

(v//M), = v//M, = (v/*) Ay„ 
and (v//M) = (v/*)€ A y. 

Finally, v/v//M = ( v/*) A (v/y). 

The derivation of equation 7.10 is similar. 

7.2 CANONICAL FORMS 
Intrinsic vector 

Any function defined on a finite domain can be specified by listing each 
possible value of the argument together with the corresponding function 
value. For a logical function of n variables, the n arguments may be 
considered as the components of a logical vector x of dimension n, and the 
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domain of the function is then represented by the (rows of the) matrix 
T(n) of dimension 2 n x n defined as follows: 

A_T k = k. 

For n = 3, for example, T has the form shown in Table 7.1. 


T 

Kf) 

p 

= 

T 

A 

X 

0 

0 

0 

1 

x 0 

A 

*1 

A 

x 2 

0 

0 

1 

0 

*0 

A 


A 

X 2 

0 

1 

0 

1 

x 0 

A 

*1 

A 

*2 

0 

1 

1 

0 

*0 

A 

*1 

A 

X 2 

1 

0 

0 

1 

*0 

A 


A 

X 2 

1 

0 

1 

0 

*0 

A 

*1 

A 

x 2 

1 

1 

0 

0 

*0 

A 

*1 

A 

x 2 

1 

1 

1 

0 

*0 

A 

*1 

A 

X 2 


/(*> = *(/) i p 

= (*0 a *! A x 2 ) V (*„ A *! A * 2 ) V (* 0 A*,A x 2 ) 

Table 7.1 The disjunctive canonical form 

A logical function / can therefore be specified by its intrinsic vector i(f) 
defined by: 

w) =/(n- 

Table 7.1 shows /(/) for the function 

f{x) = (# 0 A x 1 A #2) V (#o A x 1 A x 2 ) V (^0 A #1 A #2) 

= (*o V ^1) A ^ 2 - 

Applying the usual notation for operations on variables to operations 
on functions as well (e.g., / denotes the function inverse to /, and/ A g 
denotes the conjunction of the functions / and g) permits the expression 
of certain easily derived identities concerning intrinsic vectors: 

*(/) = i(f) 

Kf v g) = Kf ) v i(g). 

More generally, the intrinsic vector of any function of functions is the 
same function of their intrinsic vectors. 

The two trivial functions identically one and identically zero will be 
called, respectively, the unit function and zero function, and will be denoted 
by 1 and 0. Thus /(l) = €, and i(0) = €. 
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Characteristic vectors 

A vector which represents a function/is called a characteristic vector of 
/. The intrinsic vector i(f) is but one of several useful characteristic 
vectors. 

The expression y t x denotes the function obtained by first negating 
each x i for which y i = 0 and then taking the conjunction (that is, and 
function) over the resulting vector. Such a function is called a minterm in 
x , and the components of the minterm vector 

p = T ^ x 

comprise all possible minterms in x with no repetitions. 

The component p t is a function of x which assumes the value one if and 
only if x = T i . Consequently, for any function f 

m = v imp) 

= V/(*(/) A p) 

= Kf) A p. 


This relation is illustrated by Table 7.1. 

The expression 

m = Kf) aP = *•(/) a (Ttx) 

is called the disjunctive canonical form of the function / since it displays / as 
a (unique) disjunctive (that is, or) function of minterms, each of which 
involves all variables and conjunction and negation only. The disjunctive 
is one of several canonical forms of interest, each of which assumes the 
form 

/(*) = Y (f)%\ s(x), 

where the characteristic vector y (f) is a function of/ only, and th e specific 
vector s(x) is a function of the argument x only. Each of the four forms of 
interest is characterized by the particular over-all operator Ox occurring 
in it and is therefore called the disjunctive , conjunctive , exclusive disjunctive , 
or equivalence canonical form* according to whether O x is the operator 
V, A, or =. 

The characteristic vector and the specific vector appropriate to each form 
will be characterized by its over-all operator o v Thus 

m = Y if, O x )%\s{x, Oj). 

* The functions x V y, x A y, (x ^ y), and (x = y ) are, except for the trivial functions, 
the only associative commutative functions of two variables. 
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The forms are defined formally by the following expressions.* 

Y(/> v ) a (T # x) (Disjunctive) (7.11a) 

_ . Y(/, A ) v ( T = *) (Conjunctive) (7.11b) 

Y(/> 7 ^) a (T v *) (Exclusive disjunctive) (7.11c) 

- Y(/. =) v (T a *) (Equivalence) (7.lid) 

Table 7.2 shows the intrinsic vector of the function / of Table 7.1 
together with corresponding pairs of characteristic and specific vectors. 
These may be substituted in equations 7.11 (a-d) to verify that they do 
represent the function /. 

Since x = T k = T k for some k , equation 7.11a may be written as 

4(/) =/(T*) = Y (/> v)X(r; ?*). 

Consequently, 

[ Y(/> v) X (T £ T) = Y(/> V) X 5 ( V) (7.12a) 

Y(/, A) 0 (T V T) = y(/> A) 0 5(A) (7.12b) 

* J I Y(/> =*)t (T 0 T) = y(/, #) X 5(^) (7.12c) 

- Y(/> =) v (T a ?) - y(/, =) 7 5(=) (7.12d) 

Each of the matrices 5(0) appearing in the right limbs of equations 
7.12(a-d) is a fixed function of T and is called the O-specific (e.g., dis¬ 
junctive specific) matrix. Since i(f) is a function of y(/, o) and 5 (o), the 
relation between the intrinsic vector and each characteristic vector is 
determined by the corresponding specific matrix. 

Since 5 (v) = (T £ T) = (T * T), it is clear that 5 (v) = /. Conse¬ 
quently, y(/, v) \ 5 (v) = y(/, v), and therefore, 

Y(/> V) = <(/). (7.13) 

Similarly, 5(a) = /”, and y(/, A) 0 5(a) = y(/, A), and again 

Y(/> A) = /(/). (7.14) 


An explicit expression for 5(^) may be obtained by induction on the 
dimension of the corresponding argument, and, to facilitate this, the 
notation T(ri) and 5(^, n) will be used for the matrices appropriate to an 
argument x of dimension n. T(n + 1) may be written in partitioned form 
as 


T(n + 1) = 


T(n) 
T(n )J 


* The expression T ^ x used here for the specific vector in the disjunctive form is 
equivalent to the expression T used earlier. Its use increases the uniformity of the 
expressions for the canonical forms. 
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n + 1) = I ~"K 

= £ A S(n) 

E A S(#, n) 


E A S(^, n) 
E A S(j£, n). 


since e$e = e$e = E, and c § c = E. Finally, 


n + 1) = 
R 11 


Since S(1 ) = 


+ I> -L'. T~ 

it is clear that 


S(^,n) 
S&, n). 


S&, 2 ) = 






"l 

1 

1 

1 

1 

1 

1 

r 





0 

1 

0 

1 

0 

1 

0 

1 

~1 

1 

1 

n 

0 

0 

1 

1 

0 

0 

1 

1 

0 

1 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

1 

J , *S’(#, 3) = 

0 

0 

0 

0 

1 

1 

1 

1 

_0 

0 

0 

lj 

0 

0 

0 

0 

0 

1 

0 

1 





0 

0 

0 

0 

0 

0 

1 

1 





_0 

0 

0 

0 

0 

0 

0 

1_ 


and so forth.* 

The following useful properties of are easily verified for the fore¬ 
going examples and may be established generally by formal induction. 
The matrix is self-inverse with respect to the operations (^), that is, 

s&) A S&) = I (7.16) 

Moreover, since every row of the transpose S(^) save the zeroth has an 
even number of ones, 

(2c) |+/£(*) = «?. 

Hence by equation 7.5, 

*ISfr) = €°. (7.17) 

Since (by equation 7.4) S(=) = S (^) and since S(^) is of even dimension, 
the same result holds for S(=). 

* This result was first obtained by Muller (1954), who employed the matrix C of 
binomial coefficients and showed that S(^) = (2E)| 0 C. Also see Calingaert (1960). 
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Equation 7.12c gives i(f) as a function of y (/, tO andS(T^). This relation 
is more commonly written in transposed form and with iff) replaced by the 
equivalent y(/, v) as follows: 

Y(/> v ) = S( z £)t Y(/> #)• (7.18) 

Since S(t^) is self-inverse (equation 7.16), premultiplication of equation 
7.18 by Sffi) f yields an identical expression for y(/, f=) as a function of 
Y(/, v), namely 

YC/, *) = £(#) aY(/.V). (7.19) 


The characteristic vectors of the identity function 1 and of the zero 
function 0 may now be derived. Clearly i( 1) = € and i(0) =€ . Hence by 
equations 7.13 and 7.14, 

Y0> V) = y(1> A) = e, 

and 

Y(°> v) = y(0. A) = I. 

Moreover, 

Y(l> #) = $(#) A « 

= #/£(*). 


Hence, by equation 7.17, 

Y(l, ¥} = e°. (7.20) 

Similarly, y(0, =£) = i. 


The relations between the characteristic vectors of a function f and of its 
inverse/ may now be obtained. Since 


Kf) = 1(f), 


(7.21) 


then, by equations 7.13 and 7.14, 

Y(/7 v) = Y(/> v), 


and Y(/> A) = y(/> A). 

Moreover, Y (/~ #) = $(#) a «'(/) = $(*) a *(/), 
by equations 7.13, 7.19, and 7.21. Hence 

yC/7 = (y(/, ^ c 0 ) (7.22) 

by equations 7.7, 7.17, 7.19, and 7.13. Characteristic vectors of a function 
and of its inverse are displayed in Table 7.2. 

The relation between y(/, 7 ^) and y(/, =) may now be obtained by 
applying equation 7.22 to equation 7.12c to yield 

1(f) = Kf) = (Y if, *) f 6°) t S&). 



s( *) i Y(/, *) S( =) = y(/ 5 =) i (/) ■§( =) 7 Y(/, #) 5( *) t w. 
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Table 7.3 Relations among characteristic vectors 
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Application of De Morgan’s law and of the fact that S(=) = S(=£) yields 


*•(/) = (Y if, fi) # €°) V *(=)• 

Comparison with equation 7.12d shows that 

Y if, =) = (Y 

= (Y(/, fi) = €°). 

The relations among the various characteristic vectors are summarized 
in Table 7.3. 


7.3 DECOMPOSITION 


A logical function f(x ) is said to be decomposable on u if it can be written 

in the form „ x nr-t \ i \ 

fix) = g(h(u!x), ujx). 


where g and h are logical functions. Since /, g, and h are functions of 
v(u), (1 + +/u), and +/u variables, respectively, then if +/u > 1, both 
g and h are functions of fewer variables than/. Decomposition on u such 
that +/u > 1 thus permits / to be expressed in terms of simpler functions 
g and h and provides an important basis for simplification techniques. 

Every function is decomposable on u = e. Moreover, if +/u = 1, then 
u = €* for some i, and 


fix) = [x ( A f{x A €0] V K A fix V €*)]. 

Since both f{x A I*) and/(# v € 0 are expressible as functions of €*/#, then 
the foregoing expression is of the required form with h(u/x ) = Con¬ 
sequently, all functions are trivially decomposable for +/u = 0 or 
+/w= 1 . 


Disjunctive canonical form 

Ashenhurst (1957) determines nontrivial decompositions of /(#) by 
arraying the intrinsic vector i(f) in a 2 +/u X 2 +/u matrix F defined as 


follows: 

F/ = h(f), 

where 

k = (2e) _L k 


i = (2e) JL (u/fe) 

and 

j = (2€) J_ («/*). 


If, for example, u = (1,0,1,1,0), then the index k of each component i k (f) 
appearing in F is given by the matrix* C of Table 7.4. The table also shows 

* Ashenhurst (1957) calls the matrix C a decomposition chart and represents F by 
circling each element of C which corresponds to a nonzero component of i (/). 




Table 7.4 Decomposition of the function f(x) on u 
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Let p = T £ x, 

q=T* <u/x) 9 
and r = T £ (u/x) 

be the minterm vectors of x, u/x, and u/x, respectively.* Then the matrix 

P = q° A r 

contains the components of p arrayed in the same order as the components 
of i(f) in the matrix F. Consequently,! 

/(*) = <‘(/)Ip = V/V//(FAP). 

Decomposability depends on the structure of the matrix F. If each 
column of F is either zero or equal to the vector a, there exists a vector c 


such that 

F = a ° c. (7.23) 

Hence, f(x) = \j/\jj/M, 

where M — (a % c) A (q % r). (7.24) 

But by equation 7.8, M = (a A q) % (c A r), (7.25) 

and hence by equation 7.9, 

/(*) = V/V//M = (V/(a A q)) A (V/(c A r)) (7.26) 

= (« a 9 ) A (c )( r). 


Since the first and last terms on the right of equation 7.26 are, respec¬ 
tively, functions of u/x and of w/# only, the function f(x) is decomposable. 
The required functions are simply 

h = h(u/x ) = #A? = «I[r^ (u/x)'] 

f(x) = g(h, u/x) = h A [c X r] • (7.27) 

' = T $ (u/x) 

Since equation 7.27 does not represent the most general possible function 
of h and u/x, it appears that the characteristic matrix F = a X c does not 

* Although denoted by the same symbol, the matrices T are of differing dimensions 
as required by compatibility. 

t Since i(f) is equal to y(/> V )> it may be substituted for it in the disjunctive canonical 
form. 
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represent the most general function decomposable on u. Ashenhurst 
(1957) has shown that the most general type of function decomposable on 
u is represented by a characteristic matrix of the form* 

F = (a° A b)V(a° A c)V (e A ° d) V (c X *), (7.28) 

where b , c, d , and e are mutually disjoint and collectively exhaustive, that 
is, 

b + c + d + e = e. 

The fourth term of equation 7.28 is identically zero and is included only 
for formal completeness; hence 

F = (5 £ b)V (a ° A c)V(e° A d). (7.23') 

Equation 7.24 now becomes 

M = ((a ° A b) V (a ° A c) V (c X d)) A (q ° A r), (7.24') 

and since conjunction is distributive over disjunction, equation 7.8 may 
again be applied to yield 

M = ((a A q) % (b A r)) V ((a A q) A ° (c A r)) V ((e A q) % (d A r)). 

(7.25') 

Moreover, since v/v//(AT v F) = (v/v//^) V (v/v//I0, equation 7.9 
may again be applied to yield 

/(*) = V/V//M 

= ((« a 9 ) A (b )( r)) V ((a \ q) A ( c )( r)) V q) A (d\ r)). (7.26') 

Since q is a specific vector of the disjunctive canonical form (i.e., a 
minterm vector), it is some column of the specific matrix S(v). Since 
S(V) = /, q therefore contains exactly one nonzero component, and 
consequently (a v g) = (a \ q), and € )( q = 1. Equation 7.26 can thus 
be rewritten in decomposed form as 

h = h{ulx) = alq = a\(T £(5/#)) 

/(*) = g(fc, u/x ) = (h A (b X r)) V (A A (c X r)) V (d X r) ^ (7.27') 
»• = T % (i u/x ) 

It is interesting to note that no use has been made of the fact that 5, c, 
and d are disjoint. Relaxation of this restriction does not, however, 
increase the generality of the matrix F, since ave = €, ave = a, and 
a v a = €. It does suggest, however, that the matrix F may be expressed 
more compactly as 

F = {a ° A m) V {a ° A n), 

* In Ashenhurst’s terminology, the matrix F must be of the following form: each 
column is either all zeros, all ones, the vector «, or the vector a. 
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where m = b v d and n = c v d. The second line of equation 1.21' 
then becomes 

/(*) = Ch A (m l r )) V (A A (n X r)), (7.28) 

a reflection of the obvious fact that the third term (d \ r) of equation 1.27 
can be incorporated in the preceding terms. 

Table 7.4 shows a complete example of the decomposition process for 
u = (1, 0, 1, 1, 0). The characteristic matrix F is obtained by applying 
the matrix C to *'(/)• Clearly 

F = (a % b) V (a a c) V (c % c) 

= (a % m) V (a X n). 

Consequently, 

f(x) = (h A (b X r)) v (A A (c X >•)) V (d X r X 
where h — a \ q, q = T * (a/*), and r = T ^ (u/'tf). 

Since Q = (#^ a #4, #1 A ^4, ^ A #4, A #4), 

and a = (1, 0, 0, 1), 

then h = (x 1 A i 4 ) V (x 1 A * 4 ) = (x x = * 4 ). 

Similarly, )( r = (* 0 A x 2 A * 3 ) V (* 0 A x 2 A * 3 ) 

= x 2 A (*o ^ *s)» 

c a r — (&o ^ ^3) V (^o A x 2 A # 3 ), 
and d r = x 0 A x 2 A x 3 . 

Alternatively, the use of the vectors m and n yields the solution 

f{x) = {/z A [(#0 A #2 A #3) V (*o A *3)]} V {h A [(# 0 A #3) 

V (#0 A {x 2 = #3))]}. 

The entire decomposition process is described by Program 7.5. Steps 
1-7 determine the characteristic matrix F appropriate to the decomposition 
u. The loop 2-7 is repeated for each value of k from zero to 2 V{U) — 1. 
Step 2 determines k as the vector (of dimension v(u)) whose base two 
value is k. Steps 3 and 4 then specify the indices / and j appropriate to k, 
and step 5 specifies element F/. 

Step 11 determines d as the vector which specifies all full column vectors 
of F, that is, d j = 1 if and only if F j = e. Step 12 determines e as the 
corresponding vector specifying the zero columns. 

If d and e together exhaust the columns (that is, d v e = e), then b and 
c (and a arbitrarily) must be set to zero. Since this is done by steps 8-10, 
the exit branch on equality at step 13 terminates the process correctly. If 
(d v e) €, then any column of the matrix {d y e) j F can be used to 
specify a; step 14 uses the first column. Step 15 determines b as the vector 
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16 

c <- a ^ F 
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€ : (ft V c V d V «) 





Program 7.5 Decomposition of/on u 


specifying the columns of F which are equal to a, and step 16 determines c 
correspondingly for the vector a. 

The function/is decomposable on u if and only if b, c , d, and e together 
exhaust all columns of F. The left-pointing exit on step 17 therefore indi¬ 
cates nondecomposability on u. 

The algorithm can be extended to test all possible values of u successively 
and so determine all possible decompositions. Use can be made of the 
obvious fact that the matrix F appropriate to decomposition on u is the 
transpose of the matrix F appropriate to u. 

Other canonical forms 

Ashenhurst (1957) remarks that decomposability is an intrinsic property 
of a logical function/and is independent of the form of its representation. 
It can also be shown that the particular algorithm of Program 7.5 is intrinsic 
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in that it applies (in a slightly generalized form) to the characteristic 
vectors of all four canonical forms. 

Since /(/) = y(/, A) = y (/, v), it is clear that the decomposition 
algorithm applies directly to the disjunctive and conjunctive characteristic 
vectors. The case for the remaining forms will be developed for the 
exclusive disjunctive form only. 

Let F be defined as in Program 7.5 but with i( f ) replaced by y(/, ¥=)• 

Moreover, let - A 

p = T () x 

q=Tj C u/x ) 

r = T 0 (u/x), 

and p = q ° r . 

Then, clearly f(x) = ^/^//(F A F). 

As in the case of the intrinsic vector, the structure of F must be expressed 
in terms of the characteristic vectors of a given function h , of its inverse h , 
and of the identity and zero functions. In the exclusive disjunctive form, 
Y(l> ¥0 = €°, y(0, ^) = €, and if y(/z, =£) = a , then y(/i, = (a =£ €°). 

The term in € may again be disregarded and the form required of F for 
decomposability may (following equation 7.23') be written as 

F = [(a ^ €°) A ° fc]V(a;c)V (€° ^ d), (7.23") 

where b, c, and d are mutually disjoint. 

The matrix M such that f(x) = may now (as in the analogous 

case of equation 1.25') be obtained by applying equation 7.8: 

M = (((« ¥= €°) A q)%(b A r)) V ((a A q) % (c A r )) 

V ((€° A q) % (d A r», (7.25") 

Equation 7.25" will also be written as 

M = U v F V IF, 

where (7, F, and IF denote the successive matrices of the right limb. 

Since b, c, and d are disjoint, so also are the matrices U, V, and W. For 
any pair of disjoint matrices X and Y, it is easily shown that 

V Y) = (&W/X) * &l*HY)). 

Hence 

/(*) = = {[(^//i/) * &WIV)] * &MW)}. 

The application of equation 7.10 to each of the matrices U, V, and W now 
yields 

f(x) — {[(((a =£ €°) A q) a (b t r)) ^ ((a * q) A (c t r))] 

± ((«° tq) A (dt r))}. (7.26") 



262 The logical calculus 


§7.3 


Since a, (a ^ €°), and €° are the characteristic vectors of the functions 
h, h , and 1, respectively, equation 7.26" may be written in the decomposed 
form 


h(u/x) = a t (T 0 (u/x)) 1 

/(*) = d(h A (f> a >•)) ^((|A (c A 0)) =£ (d i r)) - 
r = T 0 (u/#) 


(7.27") 


For the example of Table 7.4, 


Y(/, = (1, 1, 1, 1,0, 0, 0, 1, 1,0, 1,0, 0, 0, 1,0, 0, 1,0, 1, 1, 1,0, 1, 1,0, 1,0, 1,0, 1,0). 

Table 7.6 show its decomposition in the exclusive disjunctive form. 


1 1 0 0 0 0 1 0\ 

110 111111 
110111111 
0000000 0 / 

b = (0 0 0 1 1 1 0 1) 

c = (1 1 0 0 0 0 1 0) 

d = (0 000000 0) 


Table 7.6 Decomposition in exclusive disjunctive form 

The only change required in Program 7.5 is in the specification of the 
vectors d and b on steps 11 and 15. These may be replaced by the following 
steps: 

11 d <-e° = F 
15 b*-(a^e°) * F. 

The program may be made completely general (i.e., for y(/, o)) by simply 
replacing €° withy(l, O) in the foregoing steps, and replacing € with 
y(0, O) in steps 8, 9, 10, and 12. 
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EXERCISES 

7.1 Use De Morgan’s laws (equations 7.2 and 7.3) to establish 

(a) the identity U () V = U\V. 

(b) the identity U * V = U v V. 

(c) equation 7.4. 

7.2 Show that the expressions x = (y ^ z) and y = (x z) are equivalent. 

7.3 For the function /(* 0 , x l9 x 2 ) = (* 0 A (*i A * 2 )) V (* 0 A x 2 ), derive 

(a) the intrinsic vector /(/). 

(b) the four characteristic vectors of /. 

(c) the intrinsic vector of /. 

(d) the four characteristic vectors of /. 

7.4 Use the matrix ® of equation 7.1 to summarize the canonical form expres¬ 
sions (equations 7.11 (a-d)) in a single equation. 

7.5 Show that y(/, A ) = Y(/, A ). 

7.6 Use De Morgan’s laws and the result of Exercise 7.5 to derive equation 
7.11b from 7.11a. 

7.7 Show that if y A z = 0, then 

*t(y v*) =((*iy) 

7.8 Let f and g be two disjoint functions (i.e.,/(*) A g(x) = 0 for all x ), and 
let /r =f \/ g. Derive expressions for the four characteristic vectors of h in terms 
of the four characteristic vectors of / and g. 

7.9 Each of the sixteen logical functions of two variables may be characterized 
by its intrinsic vector i(f) and be denoted by 

y ) = P( x > K A y)- 

For example, (x A y) = P(x, (0, 1, 1, 0), y). (The function p(x, i( f), y) is some¬ 
times called the A;th Boolean function and denoted by p k (x , y ), where k = -L »•(/).) 
This notation can be extended to vectors x and y so as to permit different func¬ 
tions to be specified for each component. Thus 

z <- P(x, F, y)<Z>z t = fS(x h F h y ( ), 

where v(F) = r(x) and fi(F) = 4. Show that 
(a) if * = (0, 0, 0, 0, 0, 0, 1, 1), y = (0,1,0, 1, 0,1, 0, 1), 
and JL F = (0, 0, 0,0, 6, 6, 6, 6), then /?(*, F, y) 

= ( 0 , 0 , 0 , 0 , 0 , 1 , 1 , 0 ). 
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(b) P(x,F,y) = K*,F,y) 

(c) p(x, F v G,y) = p(x, F,y) v P(x, G,y) 

(d) P(x, F A G,y) = p(x, F,y) A P(x, G,y). 

7.10 The functions P(x, F, y) defined in Exercise 7.9 can alternatively be ex¬ 
pressed as P(x 9 f,y), where / =i!F. Develop relations on / corresponding to 
those of Exercises 7.9 (b-d). 
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S.l CONVENTIONS 
Basic conventions 

(a) 1-origin indexing assumed in this summary. 

(b) Controlling variables appear to the left, e.g., u/x, b _L y, k \ x, and 
u r x. 

(c) Dimension n may be elided (if determined by compatibility) from 
€(«), e k (n), a k (n) 9 io k (n), and i j (n). 

(d) The parameter j may be elided from operators \ j9 0 j9 and i j9 and 
from the vector i j if j is the index origin in use. 

(e) The parameter k may be elided from k f x if k = 1. 

Branching conventions 



The statement to which the arrow leads is executed next if ( x0ty ) 
= 1; otherwise the listed successor is executed next. An unlabeled 
arrow is always followed. 

(b) x :y, r->s 

The statement numbered is executed next if ( xr i y) = 1 . The null 
symbol ° occurring as a component of r denotes the relation which 
complements the disjunction of the remaining relations in r. 

(c) —► Program a , b 

Program a branches to its statement b. The symbol a may be elided 
if the statement occurs in Program a itself. 

Operand conventions used in summary 



Scalar 

Vector 

Matrix 

Tree 

Logical 

u , v , w 

U, V, tv 

U, V , w 

U, V, w 

Integral 

h, i, j, k 

ft, i, j , k 

H, /, J, K 

H, I, J, K 

Numerical 

v, y , ^ 

*,y, z 

x , y, z 

X, Y, Z 

Arbitrary 

a, b , c 

a , b, c 

a, B, C 

A, B, C 
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S.2 STRUCTURAL PARAMETERS, NULL 


Dimension 

v(a) 

Number of components in vector a 

§1.5 

Row dimension 

v(A) 

Number of components in each row 
vector A* 


Column dimension 

/((A) 

Number of components in each 
column vector Aj 


Height 

’’(A) 

Length of longest path in A 

§1.23 

Moment 


Number of nodes in A 


Dispersion vector 

v(A) 

v x (A) = number of roots of A; 
v/A) = maximum degree of nodes 
on level j — 1; Kv(A)) = v(A) 


Moment vector 

P(A) 

ty(A) = number of nodes on level j 
of A; v(n(A)) = v(A) 


Degree of node i 

<5(i, A) 

Degree of node i of tree A 


Degree 

•5(A) 

(5(A) = max S(i, A) 

i 


Leaf count 

A(A) 

A(A) is the number of leaves in A 


Row dimension 
of file 

v(d>) 

Number of files in each row of a 
file array 

§1.22 

Column dimension 
of file 

K ®) 

Number of files in each column of a 
file array 


Null character 

o 

Null character of a set (e.g., space in 
the alphabet) or null reduction 
operator 

§1.3 
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S.3 RELATIONS 


Equality 

a = b 

a and b are identical §1. 

Membership 

a e b 

a = b t for some i 

Inclusion 

ni ui 

.O Q 

a j e b for all j 

Strict inclusion 

b 3 a 

a cz b 

b 3 a and a b 

Similarity 

b = a 

b z a and a z b 

Complementary 

$ 

The relation which holds if and only 

relations 


if does not. Examples of comple¬ 
mentary pairs: e, 4\ ^ ; >, >. 

Combined (ore d) 


A list of relations between two vari¬ 

relations 


ables is construed as the or of the 
relations. Thus x c= z> y is equivalent 
to x =£ y. When equality occurs as 
one of the ore d relations, it is indi¬ 
cated by a single inferior line, e.g., 

< and c=. 

S.4 ELEMENTARY 

OPERATIONS 


Negation 

w <- u 

w = 1 ou = 0 § 

And 

w *«- u A v 

w = 1 <=> u = 1 and v = 1 

Or 

w u Vi' 

w = 1 <=> u = 1 or v = 1 

Relational state¬ 
ment 

w <-(a & b) 

w = 1 <=> the relation a @ b holds 

Sum 

z*-x 4- y 

z is the algebraic sum of x and y 

Difference 

z *-x — y 

z is the algebraic difference of x and y 

Product 

z <-x x y 

z *-xy 

c <-a x u 

c •*— au 

z is the algebraic product of numbers 
x and y, and c is the arbitrary character 
a or zero according to whether the 
logical variable u is one or zero. 

Quotient 

z <-x -7- y 

z is the quotient of x and y 

Absolute value 

2 H*l 

z = x x [(a? > 0) — (x < 0)] 

Floor 

k -HxJ 

k < x < k + 1 

Ceiling 

k*- \x\ 

k > x > k — 1 

y-Residue mod h 

k *-h |,- / 

i = hq + k; j < k <j + h; and q is 
integral. 



268 Summary of notation 


S.5 VECTOR OPERATIONS 


Component-by- 

c +-a O b 

c t = a t o b t . Examples: x x y, §1.5 

component exten- 


(a ^ b ), h | ; - i, u A F, f*l. 

sion of basic 
operation 

Scalar multiple 

z *-x x y 

z <-xy 

c <-a x u 

c <-au 

z L = x x y u and c { = a x u t 

Reduction 

<- O/a 

c =(■■■ ((Oi O a 2 ) o o 3 ) • • •) © a,), §1.8 
where O is a binary operator or rela¬ 
tion with a suitable domain. Examples: 
+/*, x/#, ^/u. Reduction of the 
null vector €(0) is defined as the identity 
element of the operator O. Examples: 
+/e(0) =0; x/c(0) = 1, V/€( 0)=0, 
A/€( 0 ) =1. 

Ranking 


§1.16 

y-origin b-index 

c <-b lj a 

c = o if a 4 b ; otherwise c is the /- 

of a 


origin index of the first occurrence of 
a in b. 

y-origin b-index 

c <- b ij a 

c t = b ij a t 

of a 

Left rotation 

c <-k \ a 

ci = a h where j = v(a ) | x (/ + &) §1.6 

Right rotation 

c <-k[a 

C; = «/, where / = v(a) | x (/ - k) 

Basejy value of x 

z <-y -1 x 

z = +I(P x *), where p v = l,and §1.14 
Pi-1 = Pi x J>i 

Compression 

c *- u\b 

c is obtained from a by suppressing §1.9 
each b i for which = 0 

Expansion 

c <-u\b 

u/c = 0 , u/c = b 

Mask 

c <-la,u,bl 

u/c = u/a, u/c = u/b 

Mesh 

c +-\a, u, b\ 

u/c = a, u/c = b 

Catenation 

c *-a 0 b 

C (^j, #2> * * • ^r(o)’ ^1» • • • ^r(b)) 


\a, co'( fr >, b\ 
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S.5 VECTOR OPERATIONS (continued) 


Characteristic of 
# onjy 

tv < £y x 

yth unit vector 

tv e J (h) 

Full vector 

tv e(h) 

Zero vector 

w e(h) 
tv <-0 

Prefix of weight j 

tV <- CL j (h) 

Suffix of weight j 


Maximum prefix 

tv «*- a/u 

Maximum suffix 

w <- wju 

Forward set 
selector 

tv +- a\a 

Backward set 
selector 

tv «<— r/a 

Maxima selector 

xv <- u\x 

Minima selector 

xv <- u[x 

Interval or 

k i \h) 

/•origin 
identity permu¬ 
tation vector 


/origin permu¬ 
tation vector 

k 

/origin mapping 

c +-a h 
c - bj,a 

/origin ordering 

k <- Oj/x 


w i = (:Vi € *); K w ) = Hy) §1.15 


= 0 =j) 

U>i = 1 

u>i = 0 


§1.7 


L v(xv) = h 


First k of tv i are unity 
where k = min (/ h). 

Last k of w { are unity 
where k = min (/ h ). . 
xv is the max length prefix in u. 
Example: 

a/(l,l,0,1,0, 1) =(1,1,0,0,0,0). 


§ 1.10 


tv is the max length suffix in u. 
Example: 

eo/(i, 1,0,1,0,1) =(0,0,0,0, 0,1). 


= 1 if aj ^ Oi for all j < i 


Wt = 1 if dj ^ a i for all j > / 

tv i = u t A (*i = m) where §1.18 

m = max (ujx) j 

j 

tv { = A (*i = m) where 
m = min («/*)? 

k=dj + + h-\) §1.7 


fe = i \v(h)) §1.17 

c t = o if b t 4 i i (v(a )); otherwise 
c i — a b in a/-origin system for a. In 
the first form (that is, c +- a b ), the 
origin cannot be specified directly. 
y = k\,jX is in ascending order and 
original relative ordering is main¬ 
tained among equal components, that 
is, either y { <y i+ 1 ovy i = y i+l and 
k t < *s #+1 . 
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S.6a ROW GENERALIZATIONS OF VECTOR 


OPERATIONS 



Z *-X o Y 

z> = x/ © y/ 

§1.5 

z - OjX 

z* = Q/X* 

§1.8 

C ®/A 

c = Aj ® a 2 ® • • • 

§1.15 


• • • ® A, 


M *- B i h A 

M* = B* („ A' 

§1.16 

C ■<— k f A 

C* =ki f A* 

§1.6 

C -<- ft J A 

aT 

II 


z *-Y LX 

s* = Y i ±X i 

§1.14 


C-A b 

Cj = A „, 

§1.17 

C ■*- B A 

C ! ’ = B i J A A* 


K*~0JX 

K‘ = OJX* 



C +-A © B 

C* = A* © B* 

§1.9 

C <-u\B 

C i = u/B* 


c <- U/B 

c = [/i/B 1 © • ■ • 



• • • © [7"/B" 


C <- u\B 

a/C = 0, u/C = B 


C <- U\b 

U/C = 0, U/C = b 


C <- \A, u, B\ 

u/C = A, u/C = B 


C - \fl, U, b\ 

U/C = a, U/C = 6 


C<-/A , u, BI 

u/C = u/A, u/C = u/B 


c<-ia 9 u,bi 

U/C = U/A, U/C = U/B 


C <- /a, 17, b/ 

C = /E\a, C7, E\6/ 


W a-ol/U 

W* = a/17* 

§1.10 

W +-a)/u 

IT* = c o/U* 


W <-o/U 

W* = a/17* 


W <-t/U 

jy* = r/i7* 


W <- l/fX 

IF* = [7‘jX* 

§1.18 

W 7 l/[X 

IF* = 17'j.y* 
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S.6b COLUMN GENERALIZATIONS OF VECTOR 


OPERATIONS 



Z *-X o Y 

Z/ = X/ o v/ 

§1.5 

z O //X 

Zj = O /Xj 

§1.8 

C *- ®//A 

C = A 1 ® A 2 ® • • • 

§1.15 


• ■ ■ ®A>‘ 


M *- B u h A 

— Bj ift Aj 

§1.16 

C+-k{\A 

Cj = | Aj 

§1.6 

C +-k\}A 

Cj = kj | Aj 


z <-yjlx 

Zj = Yj 1 Xj 

§1.14 

C <-A b 

.cT 

II 

'6 

§1.17 

c *-B JJ A A 

Cj- — By J/1 Aj 


K - Ml* 

Kj = w 



C <- A ®©B 

Cj = Aj ® Bj 

§1.9 

C <- u//B 

Cj = a/Bj 


c +- U//B 

c = Cj/Bj © • • • 

• • • © UJB V 


C *- u\\B 

«//C = 0, «//C = B 


C *- U\\b 

Ul/C = 0, C//C = 6 


C *- \\A, u, B\\ 

a//C = A, u//C = B 


C - \\a, U, b\\ 

C//C = fl, U//C = 6 


C - I/A, u, B\\ 

a//C = u//A, m//C = a//B 


C - IIA, U, B/l 

Ul/C = C//A, C//C = 17//B 


C *- lla, U, b/l 

C = /E\\o, C, £\\fe/ 



PE *-<x//C 

lEj = a/U, 

§ 1.10 

W — (ol/U 

VEj = co/l/j 


W *-ol/U 

lEj = a/Uj 


W *- t //17 

Wj = r/Uj 

§ 1.18 

it crrx 

IEj = CjfXj 

IE CHX 

IEj = CjlXj 
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S.7 SPECIAL MATRICES 


Full matrix 
Zero matrix 

Superdiagonal 
Identity 
Upper left 
(triangle) 
Upper right 
Lower left 
Lower right 


W<-E(p,q) 

W+-1(p, q ) 

W+- 0 
W +- k l(p , q) 
W<-l(p,q) 

W<-D(p,q) 

W+-0 (p,q) 

W<-U(p>q) 

W*~U(p,q) 


Wf = 1 
Wf = 0 


Wf = (i + k=j) 

W = l°(p, q) 

Wf 

- = (/ 4 -j< w),| 

W ?+ 1 “* m = min (p, q)\ 

yy v+1 -j J 


p{W)=p 
v(W ) = q, for 
p and q 
integers. 

' Elision of 
p and q if 
dimensions 
determined 
by 

compatibility. 


§1.13 


S.8 TRANSPOSITION 


Diagonal 

C <-B 

\ 

C <-B 

c i’ = 

Counter diagonal 

/ 

C <-B 


Horizontal 

C <-B 

t 

C <-B 

£^(B)+1 — < _ 

Vertical 

C<i _ 

Vector 

t t 
* -► * 

= *,+!-( 


S.9 SET OPERATIONS 


Intersection 

Difference 

Union 

Cartesian 

product 


c +-b n a 
c <-b a a 
c <-b u a 
C ^-b 1 ® 

• • • ® b n 


c=e b °lb §1.15 

c = V/6 
c = b © (a a b) 

C i+rf±<*-e>=( bfei i,6 fe 2 

dj = v(b>); 1 <hj< dj 
Clearly, i’(c) = n. and /((c) = x id 
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S.10 GENERALIZED MATRIX PRODUCT 


C 


A %\ B 


C *-A 6 b 
C <-a 6 B 
C «-<j O b 


C/ = Q 1 l(A i o 2 R,), where 0 2 produces a vector §1.11 
(i.e., is not the operator -L), and O x is a reduction 
operator (and hence C/ is a scalar). 

C l = {A 1 O 6), where O is any operator which 
produces a vector of dimension v(b). 

C 3 - — (a O R,), where O is any operator which 
produces a vector of dimension v(a). 

Cf = («, O *,). 


S.ll FILES 


File 

0 / 

A representation of a of the form § 1 

(Pi? °l5 Jp2? • * • > Pv(a) +1 > °> 

Pr(a)+2» - • ->Pv(p))> where p^ is the 
partition at position /?, p x = p v(p) = A, 
and (a 1 A c^/p X. 

Position file 

3 

e 

t 

Set file to position h. Called rewind 
if h = 1, and w/W if h = v(p). 

Record (from position h) 


Forward 

o®/ ^a,\ k 

a h +- <*, p h+1 X fc ; stop at position 
h 4- 1. Zero prescript may be elided 
and \ may be elided. 

Backward 

i®/ 

a h-i <*- a ; p,^ \; stop at 
position h — 1 . X x may be elided. 

Read (from position h) 


Forward 

a,b 0 0>/ 

a+-a h \ b*-p h+1 ; stop at position 
h 4- 1 . Associated branch is controlled 
by p h+1 , and may be elided. Zero 
prescript may be elided. 

Backward 

a, b fb* 

a <- /? Ph_ 1 ; stop at position 

h — 1 . Associated branch is con¬ 
trolled by P/^x and b may be elided. 

File array 

Full 

(I) 

Array of files <£/, for / e i 1 (/^(<T>)), 
j € iV(O)). 

Row 


Row of files 0/, for / e 

Column 


Column of files 0/, for / e i 1 t/<(0)). 

Compression 

Row 

a/<£ 

Selection as in corresponding opera¬ 

Column 

«//® 

tions on matrices. 
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S.12 TREES 


Path 1 

c *- A* 

c x is the i x th root of A; c j is the ijth §1.23 
node of the nodes on level j reach¬ 
able from node Cj_ x . 

Node i 

c*-{ \%i) 

The final node of path A*. 

Subtree i 

C +-A, 

C is the subtree of A rooted in node i. 

Component-by¬ 

component 

C A O B 

(C%*) = (A*)„(i) O (B f ) v(/) . 

Path reduction 

C - g/a 

Reduction by operator or relation O 
on nodes in left list order. 

Level reduction 

C <- OH A 

Reduction by operator or relation O 
on nodes in right list order. 

y-origin fe-index 

B ■<- b ij A 

(B'). (< ) = & ‘i((A'W 

y-origin mapping 

C*-bU A 

Rooted subtree C t - is a single null 


character node if b { 4 i ? '(jx 1 (A)); other¬ 
wise C i = A b ., where A is treated in a 
y-origin system. 


Full right list C <-] A 1 The rows of the index matrix a 2 /C are 

matrix the right (left) justified index vectors 

(with null fill to the common 
dimension v(A)) arranged in increasing 
Full left list C *-[ A order; C x and C 2 are the correspond- 

matrix J ing degree and node vectors of A. 

Right list matrix C a 2 /]Al The degree and node vector columns 
Left list matrix C a 2 /[Aj of the full right (left) list. 

Tree compression C +- U/A C is obtained from A by suppressing 

node i if node i of U is zero and re¬ 
connecting so that for each remain¬ 
ing pair of nodes, the one lies in the 
subtree rooted at the second if and 
only if it did so in A. 

Path compression C u/A C is obtained from A by suppressing 

all nodes on level j if Uj = 0, and re¬ 
connecting as in the compression U/A. 
Level com- C «//A C is obtained from A by suppressing 

pression rooted subtree A ; - if Uj = 0. 

Level mesh C «-\\A,u,B\\ u\\C — A; u//C = B 

Level mask C «-//A,u,B// u\\ C = «//A; u\\ C = u//B. 


Path catenation C A © B C is obtained by connecting roots of 

B to leaves of A, allotting successive 
groups of at most ^(B) A(A)] 

roots of B to each successive leaf of A. 
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S.12 TREES (continued) 


Full tree 

W - E 

Each node of W is unity and the § 1.23 

structure of W is determined by 
compatibility. 


W E(fe) 

Each node of W is unity; W is homo¬ 
geneous (i.e., all nodes on any level 
have a common degree) and v(W) = k. 

Zero tree 

W -I 

Each node of W is zero and the 


W -0 

structure of W is determined by 
compatibility. 


W «-!(*) 

Each node of W is zero; W is homo¬ 
geneous and v(W) = k. 

Path tree 

W *- u E 

u/W =0; u/W = E; structure of W 
determined by compatibility. 


W <- u E(fe') 

u/W =0; u/W = E; W is homo¬ 
geneous and v(W) = k. 

Level tree 

W <- u E 

u//W = 0, u//W = E; structure of W 
determined by compatibility. 


W < -„E(fe) 

u/l W =0; u//W = E; W is homo¬ 
geneous and v(W) = k. 

Maximization 

w-urA 

W = U A (A = wE), where m is the 

Minimization 

W -ULA 

maximum (minimum) over all nodes 
of U/A. 

Maximum path 

W ■*- a/U 

W is obtained from U by zeroing all 

prefix 


nodes of every subtree rooted in a 
zero node. 

Maximum path 

w <-«>/u 

W is obtained from U by zeroing 

suffix 


every node which contains a zero 
node in its subtree. 

Forward path 

W *-al A 

(W%o = 1 if (A%/) differs from all 

set selector 


preceding nodes of path A*. 

Backward path 

W -r/A 

(W*)„(i) = 1 if (A*) r( i) differs from all 

set selector 


other nodes of its subtree. 

Maximum level 
prefix 

W <— a//U 

c*7W = <x/€*/U 

Maximum level 
suffix 

W — a>//u 

€*/W = o>/c*/U 

Forward level 
set selector 

W - a//A 

€^/W = O'/C V A 

Backward level 
set selector 

W * t//A 

€ i/W = r/eVA 
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O-origin indexing, 71, 186 
O-residue, 14 
1-origin indexing, 12, 16 
1-residue, 14 

Abelian group, 69 
Accumulator, 78 
index, 73, 79, 83 
Active domain, 148, 153 
Add, clear and, 76 
Additive indexing, 73 
Address, 72 
leading, 106 

table sort, 176, 213, 231 
Address-ordered pool, 117 
Addressing, indirect, 32, 74, 76, 84 
system, open, 153 
Aids to serial sort, 204 
Algebra, matrix, 24 
Algorithm, Euclidean, 67 
Allocation, implicit, 108 
of memory, 105 
Alternator, 184, 188 
Amdahl, G. M., 153 
Amphisbaenic sort, 191, 195, 203 
Analysis in p, 160 
Analyzer, 159 
Arbitrary variable, 10, 13 
Argument, 10 
program, 159 

Arithmetic instructions, 84 
operations, 11 
Arrow, 2 

Ashenhurst, R. L., 188, 190, 191, 255, 258 
Assembler, 159 
Associative memory, 104 


Automatic programming, 71 
Auxiliary variables, 38 
Axis of transposition, 25 

Backward chain, 113 
Base address register, 74 
conversion, 190 
value, 27, 38, 146 
Batching, 138 
Betz, B. K., 198 
Binary operation, 13 
search, 141, 155, 221, 236 
Birkhoff, G., 26, 28, 32 
Biunique transformation, 144, 146 
Blaauw, G. A., 110 
Block chaining, 119 
sort, 191, 213 
transfer, 206, 213, 221 
Boehm, E., 153 
Boldface letters, 10 
Boolean function, 263 
Branch arrow, 2 
in files, 42, 181 
in a graph, 46 
instruction, 73, 80 
modification, 6 
Branching, 75 
ratio, 47 

Brooks, F. P., 129 

Bubble sort, 2l3, 217, 229, 233, 239 

Bucket file, 154 

Burks, A. W., 53, 122, 163, 164 
Busy indicator, 94 

Calingaert, P., 252 

Canonical form, 247, 249, 255, 261 
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Cap, 29 

Carr, J. W., 171 
Carter, W. C., 198, 203, 244 
Cartesian product, 30 
Cascade sort, 191, 198, 203 
Category, 189 
Catenated scan, 135, 139 
Catenation, 20, 21, 29 
Ceiling function, 12 
Central register, 72, 78 
Chain, 113, 117 
filial, 128 

list matrix, 121, 126 
Chained representation, 110, 116, 135, 
147, 221 
stack, 113 
Chaining block, 119 
end-around, 112, 135 
vector, 155 
Change sign, 76 
Channel-unit interlock, 95 
Channel, command, 88 
data, 88 

input-output, 75, 95 
instruction, 100 
trap, 75, 97 
Character, null, 31, 58 
Characteristic matrix, 258 
Characteristic vector, 29, 249, 253, 261 
Check, parity, 98 
Chien, R. T., 154 
Church, Robert, 239 
Circuit, 46 
Class of operand, 8 
Classification, 177 

and simple merge, 178, 186, 201, 211 
simple, 177, 205 
string, 183 
Clear and add, 76 
Clustering of keys, 154 
Coalescence, 144, 148, 152 
Code, Gray, 63 
Huffman, 51 
operation, 72 
point, 124 
variable length, 124 
Column-chained matrix, 121 
Column, compression, 22 
dimension, 14 
expansion, 21 


Column, list, 22, 108, 119 
mapping, 33 
mask, 21 
mesh, 20 
of files, 43 
operation, 8, 18 
reduction, 17 

sort, 178, 186, 191, 193, 204, 211 
vector, 14 
Command, 71 
channel, 88 
register, 73 
Comparison, 2 

of internal sorting, 239 
with zero, 5 

Compatibility, first-order, 171 
in compression, 18 
of trees, 57 
of vectors, 16 
Compiler, 159 
Components, 13, 22 
Compound operation, 8 
statement, 160, 165 
Compression, 18, 21, 110, 113 
tree, 54, 60 
Computer files, 44 
instruction, 71 
program, 71 
Congruence, 12 

Conjunctive canonical form, 249 
Connection matrix, 46 
Contained in an ordering, 190 
Control register, 72 
Controlled scan, 135, 141, 147 
Controlling parameter, 8 
Conventions, 7 
Conversion, base, 190 
Convert instructions, 87 
Copy operation, 177, 189 
Counter, instruction, 73 
Cramer, H., 138, 235 
Cup, 29 

Cycles, disjoint, 32 
Cyclic scan, 135 

Data channels, 88 
path, 101 

De Morgan’s law, 16, 24, 64, 103, 246 
Decisions, leading, 6 
Decomposition, 246, 255 
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Decomposition, chart, 257 
of a vector, 21 
Decrement, 78, 83 
Degree, 47 
vector, 49, 53, 163 
Derived key, 134, 144, 148 
Design, hardware, 71 
logical, 101 
system, 71 
Dewey, G., 129 
Dickson, L. E., 65 
Difference, set, 29 
Digital sort, 186 
Dimension, 5, 13, 38 
-ordered pool, 117 
column, 14 
elision of, 16 
in it, 106, 108 
Directed graph, 46 
scan, 135, 147 
Director, 159 
Disjoint cycles, 32 
vectors, 29 

Disjunctive canonical form, 249, 255 
Disordered pair, 233 
Dispersion, suffix, 173 
vector, 47, 59, 165 
Distributor, 78 
Domain, 10 
active, 148, 153 
Double chain, 113, 115 
Dummy item, 179 
variables, 40 

Duplicate keys, probability of, 201 
Dynamic relocation, 74 

Element, null, 6, 10, 16, 32 
of a matrix, 14 

Elementary operations, 11, 13 
statement, 160 
Eligible item, 179 
Elision, 5, 8, 11, 16, 32 
Enable, 75, 100 

Encoding matrix, 105, 109, 134 
End of file, 90 

End-around chaining, 112, 135 
Entry arrow, 3 

Equivalence canonical form, 249 
class, 134, 147 
function, 12 


Equivalence, of statements, 11 
reduction, 17 
Equivalent formulas, 166 
Euclidean algorithm, 67 
Euler's constant, 235 
Evaluation, of internal sorting, 231 
of serial sorting, 199 
Exchange sort, 212 
Exclusive disjunctive canonical form, 
249, 261 

Exclusive-or, 12 
reduction, 17 
Execute instruction, 83 
Execution, instruction, 72, 77 
sequence, 2, 5 
time of a sort, 199 
Exit arrow, 3 
Expansion, 19, 21 
Expected number, of strings, 200 
of transpositions, 232 

Falkoff, A., 104 
Families of operations, 8 
Fetch instruction, 72, 76 
mode indicator, 76, 95 
Fields, 205 
Fifo discipline, 117 
File, 40, 115, 135, 171, 176 
bucket, 154 
computer, 44 
coordinate, 177 
end of, 90 
rewind, 183 
stack, 172 
Filial chain, 128 
vector, 128 

Filial-heir chain list, 127 
Filler components, 22 
First-order compatibility, 171 
Fixed scan, 137 
Floor function, 12 
Flowcharting, 1 
Format vector, 105, 134 
Formula, 163 
equivalent, 166 
minimax form, 166 
Newton-Raphson, 64 
reduction of, 164 
singular, 164 
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Fortuitous function, 133 
Forward chain, 113, 117 
Fraction, scan, 135, 140 
Freiman, C. V., 154 
Friend, E. H., 233, 245 
Full list matrix, 50 
matrix, 22, 26 
vector, 15 

Function, Boolean, 263 
ceiling, 12 
equivalence, 12 
floor, 12 
fortuitous, 133 
identity, 253 
inverse, 37 
key, 176 
magnitude, 11 
mapping, 34 
program, 159, 169 
sign, 11 
trivial, 248 
unit, 248 
zero, 248, 253 

Gap, record, 98 
Garner, H. L., 67 
Gassner, B. J., 239 
Generator, 159 
Gilstad, R. L., 198 
Goldstine, H. H., 1, 178 
Graeffe method, 65 
Graph, directed, 46 
index, 48 

Gray code, 63, 104 
Greek letters, 9 
Grid matrix, 106, 118 
Group, abelian, 69 
rotation, 26 

Hanan, M., 154 

Harary, F., 132 

Hardware designer, 71 

Height of a tree, 47 

Homogeneous tree, 58, 121, 226 

Huffman code, 51 

Huffman, D. A., 123, 129, 132 

Identities, logical, 246 
Identity function, 253 
matrix, 26 


Identity, permutation vector, 33 
Immediate branching, 181 
Implicit allocation, 108 
specification, 38 
Inclusion, set, 28 
Index, 31 

accumulator, 73, 79, 83 
graph, 48 
list, 51, 60 

matrix, 49, 53, 73, 125 
origin, 16, 30, 32 
register, 73, 108 
systems, 14 
vector, 48, 60 
Indexability, 76 
Indexing, 0-origin, 71, 186 
1-origin, 12, 16 
additive, 73 
j-origin, 14, 43 
operation, 31, 36 
Indicator, busy, 94 
fetch mode, 95 
limit, 90 

Indirect addressing, 32, 74, 76, 84 

Infix vector, 16 

Information store, 72 

Initial node, 46 

Initialization, 6 

Input file, 41 

Input-output channel, 75, 95 
instructions, 88 
Insertion, of an instruction, 75 
ranking by, 213 
sort, 212 

Instruction, arithmetic, 84 
branch, 73, 80 
channel, 100 
computer, 71 
convert, 87 
counter, 73 
enable, 75 
execution, 72, 77 
fetch, 72, 76 
input-output, 88 
logical, 78, 83 
shift, 85 

Integral variable, 10, 13 
Interchange, 7 
Interchange sort, 232 
Interlock, channel-unit, 95 
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Internal sorting, 176, 204, 205, 212, 231, 
239 

Interpreter, 160 
Interruption, 75, 81 
Intersection, set, 28 
Interval vector, 15, 33 
Intrinsic vector, 248, 261 
Inverse characteristic vector, 253 
functions, 37 
mapping, 34 
permutation, 33, 139 
Italics, 10 
Items, 176 
dummy, 179 

eligible, 179, 181, 213, 229 
rela table, 177, 189 
Iteration, 4 

Iverson, K. E., 69, 129 

./-origin indexing, 14, 43 
Jacobsen, W. H., 146 
Jacobson, N., 28, 65, 69 
Johnson, L. R., 121, 148, 155 
Jordan box, 68 
elimination, 68 
Justification, 23 

Key, 133, 176 
clustering, 154 
derived, 134, 144, 148 
multiple, 143, 155 
transformation, 134, 144, 147 
vector, 176, 189 
Kronecker delta, 12 
Kunz, K. S., 64 

JB-formula, 163, 173 
minimax form, 166 
JB-notation, 163, 169 
JB-phrase, 163 
^-reduction, 164 
Lagrange multipliers, 226 
Lamb, S. M., 146 
Language, programming, 1 
structure, 7 
Last in first out, 117 
Leading address, 106 
decisions, 6 
Leaf, 46 
list, 122, 145 


Left list, 122 
rotation, 8 

Length, of a path, 46 
scan, 135, 148 
Level compression, 54, 60 
of a tree, 46 
tree, 57 

of structure, 39 
LIFO discipline, 117 
Limit indicator, 90 
Linear representation, 108, 110, 118 
List, chain, 121 
column, 22 
index, 51, 60 
leaf, 122, 145 
left, 122 

matrix, 49, 108, 121, 161 
processing, 110 
right, 226 
row, 22 ‘ 
vector, 122, 163 
well formed, 52 
Literals, 9 
Load and store, 78 
Logic, symbolic, 24 
Logical design, 101 
identities, 246 
instructions, 78, 83 
operations, 11 
trees, 60 
variable, 10, 13 
vector, 18 
Loop, 2 

Lukasiewicz, J., 160, 163 
Lukasiewicz notation, 122, 161, 169 

Maclane, S., 26, 28, 32 
Magnitude function, 11 
Mapping, 32, 34’, 58, 133 
Margenau, H., 69 
Marimont, R. B., 132 
Marked pool, 117 
Mask, 8, 19 

Matrices, representation of, 119 
vector of, 39 
Matrix, 13, 14 
algebra, 24 
chain list, 126 
characteristic, 258 
column-chained, 121 
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Matrix, component, 14 
connection, 46 
dimension, 5 
element, 14, 75 
enable, 100, 105 
encoding, 109, 134 
full, 22, 26 
grid, 106, 118 
identity, 26 
index, 49, 53, 73, 125 
list, 49, 108, 121, 161 
multiplication, 4 
product, 23, 33 
row-chained, 120 
sparse, 22, 120 
special, 26 
specific, 251 
superdiagonal, 26 
trace of, 27 
triangular, 26 
Mauchly, J. W., 178, 191 
Maximal path, 46 
Maximization, 36, 58 
Maximum prefix, 9, 22, 58 
suffix, 22, 58, 165 
McCracken, D. D., 195 
Member of a vector, 28 
Memory, 72 
allocation, 105 
Merge, 177 
simple, 177, 191, 205 
single-phase, 179, 183, 206 
sort, 178, 199, 204 
string-doubling, 186 
two-phase, 179 
Mesh, 19, 110 
Meta program, 71, 159 
Microprograms, 71 
Mid-square method, 154 
Minimax form of ^-formula, 166 
Minimization, 36, 58 
Minterm vector, 249, 257, 258 
Miscellaneous symbols, 10 
Mixed radix, 27 
Moment, of a tree, 47 
vector, 47, 60, 125 
Moore school, 178, 191 
Muller, D. E., 252 
Multiple keys, 143, 155 
scalar, 13 


Multiplication, elision of, 11 
generalization of, 11 
matrix, 4 

Multiplier-quotient register, 78 
Murphy, G. M., 69 

Nagler, H., 195, 242 
Newton-Raphson formula, 64 
Node, 46 

vector, 46, 49, 226 
Nonlinear representation, 109 
Normalization, 23 

Notation, Lukasiewicz, 122, 161, 169 
parenthesis, 161, 169 
parenthesis-free, 122, 163 
Polish, 122, 163 
summary, 7 
translation of, 169, 173 
Null character, 31, 58 
element, 6, 10, 16, 32 
scan, 25, 28 
vector, 16, 17 

Number base, choice of, 190 
Number system, 12, 14, 27 
Numerical variable, 10, 13 

Odd-even transposition sort, 213, 232 
Oettinger, A. G., 171 
Open addressing system, 153 
Operand, classes, 8 

structured, 8, 13, 45, 106 
Operation, arithmetic, 11 
binary, 13 
code, 72 
column, 8, 18 
compound, 8 
copy, 177, 189 
count, 212 
elementary, 11, 13 
families, 8 
indexing, 31, 36 
logical, 11 
mapping, 58 
ordering, 36 

rank-preserving, 177, 189, 231 

ranking, 31, 34, 133 

revision, 183 

row, 8, 18 

set, 28 

shift, 78 
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Operator, statement as, 160 
symmetric, 166 

Optimum m in ra-way sort, 200 
Optimum base for a sort, 202 
Ordered set, 23, 28 
tree, 45, 47 

Ordering, contained in, 190 
of statements, 5 
on *, 29 
operation, 36 
related, 189, 200, 202 
vector, 36, 176 
Output file, 41 
Outer product, 25 
Overflow indicator, 85 
positions, 78 
search, 149 
toggle, 83 

(P-notation, 163, 169 
Palermo, F. P., 154 
Parameter, controlling, 8 
Parenthesis, elision of, 8 
notation, 161, 169 
Parenthesis-free notation, 122, 163 
Parity, 12 
check, 98 

Partial pass sort, 191, 203 
Partition, 115 
file, 41, 181 
symbol, 115 

Partitioned representation, 115, 118 
Partitioning of a tree, 123 
Path compression, 56, 60 
data, 101 
maximal, 46 
tree, 57 

zero extension of, 61 
Permutation, 8, 32, 133, 139 
of a tree, 58 

vector, 33, 148, 176, 213, 231 
Peterson, W. W., 154 
Phase, 179 
Phister, M., 63, 104 
Phrase, 163 
Physical vector, 105 
Polish notation, 122, 163 
Polynomial, 27 
Polyphase sort, 199 
Pool, 113, 116 


Position file, 41 
Power of a merge sort, 199 
Pre-count column sort, 211 
Prefix, code, 123 
maximum, 9, 22, 58 
vector, 15 

Probability of duplicate keys, 201 
Product, Cartesian, 30 
matrix, 23, 33 
outer, 25 
Program, 2 
argument, 159 
computer, 71 
function, 159, 169 
in p, 160 
interaction, 7 
meta-, 71 
skeleton, 160 
statement, 2 
trace, 83, 160 
utility, 160 
vector, 7 

Programming, automatic, 71 
language, 1 
Pseudo-english, 1 

Q register, 78 

Queue disciplines, 117 

Quotes denoting literals, 10 

Radix exchange sort, 191, 212, 213 
mixed, 27 

Random-access sort, 176 
storage, 205 
Range, 10, 38, 159 

Rank-preserving operation, 177, 189, 231 
Rank, coordinate, 177 
of an array, 39 

Ranking, by insertion, 213, 236 
operation, 31, 34, 133 
sort, 213, 221, 235, 239 
Ratio, branching, 47 
storage, 205, 212, 231, 237 
Read file, 41 
Record, file, 41 
gap, 88, 98 
Reduction, 8, 16 
of a formula, 164 
of a tree, 58 
Register, central, 72 
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Register, command, 73 
index, 73, 108 
multiplier-quotient, 78 
sequence, 73 

Relatable items, 177, 189 
Related orderings, 189, 200, 202 
Relational statement, 11, 16 
Relocation, dynamic, 74 
Reordering, 32 

Repeated selection sort, 213, 223, 229, 
236 

Replacement sort, 213, 226, 239 
Representation, of variables, 105 
chained, 110, 116, 135, 147, 221 
linear, 108, 110, 118 
nonlinear, 109 
of matrices, 119 
of trees, 121 
partitioned, 115, 118 
solid, 106 
Residue, 12 

Revision operation, 183 
Rewind, 90 
file, 41, 183 
Richards, R. K., 63 
Right list, 226 
Ring, 101 
Riordan, J., 69 
Roman letters, 10 
Root, 46 
vector, 52 
Rooted scan, 135 
tree, 47 

Ross, I. C., 132 
Rotation, 33 
group, 26 
left, 8, 14 

Row-chained matrix, 120 
Row, compression, 18, 21 
dimension, 14 
expansion, 21 
list, 22, 108, 119 
mapping, 33 
mask, 21 
mesh, 20 
of files, 43 
operation, 8, 18 
reduction, 17 
vector, 14 
Run signal, 81 


Rutishauser, H., 69 

Samuel, A. L., 153 
Satellite vector, 155 
Scalar multiple, 13 
Scan, 133 

catenated, 135, 139 
controlled, 135, 141, 147 
cyclic, 135 
directed, 135, 147 
fixed, 137 
fraction, 135, 140 
length, 135, 148, 231 
null, 25, 28 

of equivalence class, 147 
rooted, 135 
shortest, 139 
to nearest terminal, 139 
Schay, G., 154 
Search, 133 

binary, 141, 221, 236 
overflow, 149 
single table, 149 
Selection, 8, 17, 32, 38 
set, 30 
vector, 22 
Selector set, 23, 58 
Self-indexing, 40 
Sequence, of execution, 2, 5 
register, 73 
Serial access, 41, 135 
sort, 176, 199, 204 
Set operations, 28 
selector, 23, 58 
Shaw, J. C., 110 
Shift instructions, 85 
operation, 78 
Sign function, 11 
Similar vectors, 29 
Simple classification, 177, 205 

classification and merge, 178, 200, 206 
merge, 177, 191, 205 
Single table search, 149 
Single-phase merge, 179, 183, 206 
vs two-phase, 200 
Singular formula, 164 
tree, 47, 60, 165 
Skeleton program, 160 
Skip, 75, 80 

Solid representation, 106 
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Sort, address table, 176, 213, 231 
amphisbaenic, 191, 195, 203 
block, 191, 213 
bubble, 213, 217, 233, 239 
cascade, 191, 198, 203 
column, 178, 186, 204 
digital, 186 
exchange, 212 
execution time, 199 
insertion, 212 

internal, 176, 204, 205, 239 
merge, 178, 204 
odd-even interchange, 232 
optimum base, 202 
partial pass, 191, 203 
polyphase, 199 
radix exchange, 191, 213 
random-access, 176 
ranking, 213, 221, 235, 239 
repeated selection, 213, 223, 229, 236 
serial-access, 176 
string-doubling, 211 
tournament, 237 
transposition, 212, 219, 236, 239 
two-phase column, 187, 193 
two-phase vs single-phase, 200 
with replacement, 213, 226, 239 
Sparse matrix, 22, 120 
Special logical trees, 60 
matrices, 26 
vectors, 15 
Specific matrix, 251 
vector, 249, 258 
Specification, arrow, 2 
implicit, 38 
program, 160 
Spread, 144, 146, 154 
Spruth, W. G., 154 
Stack, 113, 117 
vector, 166, 170 
file, 172 
Stage, 179, 231 

Statement, compound, 160, 165 
elementary, 160 
equivalence, 11 
number, 3 
ordering, 5 
program, 2 
relational, 11, 16 
Stirling’s formula, 240 


Storage, allocation in sorting, 209 
random-access, 205 
ratio, 205, 212, 231, 237 
Store, information, 72 
String-doubling merge, 186, 211 
Strings, 178 
classification, 183 
distribution of, 198 
expected number, 200 
Structure, levels of, 39 
of the language, 7 
Structured operands, 8, 13, 45, 106 
Subpath, 46 
Subroutines, 39 
Subtree, 47, 48, 51, 58 
Suffix, dispersion, 165, 173 
maximum, 22, 58 
vector, 15 

Summary of notation, 7 
Superdiagonal matrix, 26 
Supervisor, 159 
Symbolic logic, 24 
Symbols, miscellaneous, 10 
Symmetric operators, 166 
System designer, 71 

Tag vector, 79 
Tape units, 88 
Tensor, 39 
Terminal node, 46 
partition of a file, 41 
Tie, 10 

Toggle, enable, 75 
Tournament sort, 237 
Trace, of a matrix, 27 
program, 83, 160 
Transfer, 75, 80 
block, 213, 221 

Transformation, key, 134, 144 
spread, 154 

Translation of notation, 169, 173 
Translator, 159 
Transposition, 231 
axis, 25 

sort, 212, 219, 236, 239 
Trap, 75, 81, 97 
Tree compatibility, 57 
compression, 54, 60 
homogeneous, 58, 121, 223 
level, 46, 57 
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Tree, logical, 60 
moment, 47 
ordered, 45 
partitioning, 123 
path, 57 
permutation, 58 
reduction, 58 
representation, 121 
rooted, 47 

singular, 47, 60, 165 
uniform, 58, 145 
Triangular matrix, 26 
Trivial functions, 248 
Two-phase, column sort, 187, 193 
merge, 179 

vs single-phase sort, 200 

Uniform tree, 58, 145 
Union, 28 
Unit function, 248 
vector, 15, 30 
storage ratio, 212 
Univac, 106 
Utility program, 160 

Variable, 9 

arbitrary, 10, 13 
auxiliary, 38 
integral, 10, 13 
length code, 124 
logical, 10, 13 
numerical, 10, 13 
representation, 105 
Vector, 13 

catenation, 20, 21, 29 
chaining, 155 

characteristic, 29, 249, 253, 261 

coalescence, 148, 152 

degree, 49, 53, 163 

disjoint, 29 

dispersion, 47, 59 

filial, 128 

format, 105, 134 


Vector, full, 15 

identity permutation, 33 
index, 48, 60 
infix, 16 
interval, 15, 33 
intrinsic, 248, 261 
key, 176, 189 
list, 122, 163 
logical, 18 
mapping, 134 
minterm, 249, 257, 258 
moment, 47, 60, 125 
node, 46, 49, 226 
null, 16 
of matrices, 39 
ordering, 36, 176 
path, 46 

permutation, 33, 148, 176, 213, 231 

physical, 105 

prefix, 15 

program, 7 

representation, 78 

root, 52 

satellite, 155 

selection, 22 

similar, 29 

special, 15 

specific, 249, 258 

stack, 166, 170 

suffix, 15 

tag, 79 

unit, 15, 30 

Vocabulary, 163, 169, 170 
Von Neumann, J., 1, 178 

Well formation, 161 
Well formed, 52, 164 
Wind file, 41 
Woods, F. S., 235 
Word of memory, 72 
Wright, H. N., 12 

Zero, extension of a path, 61 
function, 248, 253 




